Login
Workshop

ICML'08 Workshop PASCAL Large Scale Learning Challenge -- July 9, 2008

Topics: Large scale learning; Bounded-resource learning.

PASCAL

Motivation

With the exceptional increase in computing power, storage capacity and network bandwidth of the past decades, ever growing datasets are collected in fields such as bioinformatics (Splice Sites, Gene Boundaries, etc), IT-security (Network traffic) or Text-Classification (Spam vs. Non-Spam), to name but a few. While the data size growth leaves computational methods as the only viable way of dealing with data, it poses new challenges to ML methods.

This workshop is concerned with the scalability and efficiency of existing ML approaches with respect to computational, memory or communication resources, e.g. resulting from a high algorithmic complexity, from the size or dimensionality of the data set, and from the trade-off between distributed resolution and communication costs.

Indeed many comparisons are presented in the literature; however, these usually focus on assessing a few algorithms, or considering a few datasets; further, they most usually involve different evaluation criteria, model parameters and stopping conditions. As a result it is difficult to determine how does a method behave and compare with the other ones in terms of test error, training time and memory requirements, which are the practically relevant criteria.

In the context of the Pascal (Pattern Analysis, Statistical Modelling and Computational Learning) European Network of Excellence, a Challenge is organized to enable a fair and principled assessment of existing large scale classifiers (http://largescale.first.fraunhofer.de).

The Large Scale Learning Workshop at ICML will serve to disseminate the challenge results and announce the winners of the competition. Authors of the best and most original contributions will present their work. Furthermore a panel discussion will be devoted to establishing a principled framework for the validation of large scale learning methods.

Workshop Program (Workshop Day is July 9, 2008; location S14, 3rd floor)

Morning Session:

08:30 - 09:15Welcome and Presentation of Results (Organizers)slides
09:15 - 10:00Ronan Collobert - Large Scale Learning Which Is Actually Usefulslides
10:00 - 10:15Coffee Break
10:15 - 10:35Jochen Garcke - AV SVMabstractslides
10:35 - 11:05Hsiang Fy Yu - liblinearabstractslides
11:05 - 11:35Yossi Richter - Parallel Decision Treeabstractslides

Afternoon Session

14:00 - 14:30Han-Shen Huang and Chun-Nan Hsu - Triple Jump Linear SVMabstractslides
14:30 - 15:00Marc Boulle - Averaging of Selective Naive Bayes Classifiersabstractslides
15:00 - 15:45Chih-Jen Lin - Training Support Vector Machines: Status and Challengesslides
15:45 - 16:00Coffee Break
16:00 - 16:03Kristian Woodsend - Interior Point SVM (presented by Soeren Sonnenburg)abstractslides
16:03 - 16:30Olivier Chapelle, Sathiya Keerthi - SDM SVM L1/2 and Newton SVM (presented by Chih-Jen Lin)abstract, abstractslides
16:30 - 17:00Antoine Bordes - SGD-QN, LaRankabstract, abstractslides
17:00 - 18:00Discussion and Summaryslides

Bold - PASCAL invited speaker

Organizers