ICML 2011, The 28th International Conference on Machine Learning

Program

Home

Invited Speakers

Joint ICML/ACL/ISCA Symposium

Conference

Local and Social Info

Organization

Follow us:

twitter

Facebook

Awards
Printed Proceedings
Online Proceedings
Cross-conference papers

Awards

In honor of its 25th anniversary, the Machine Learning Journal is sponsoring the awards for the student authors of the best and distinguished papers. Good Job and Congratulations!

Best Paper

The Best Paper award goes to Kevin Waugh, Brian Ziebart and Drew Bagnell for Computational Rationalization: The Inverse Equilibrium Problem.

Distinguished Paper Awards

Abhimanyu Das and David Kempe: Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection

Miguel Lazaro-Gredilla and Michalis Titsias: Variational Heteroscedastic Gaussian Process Regression

Jascha Sohl-Dickstein, Peter Battaglino, and Michael DeWeese: Minimum Probability Flow Learning

Distinguished Application Paper Awards

Lauren Hannah and David Dunson: Approximate Dynamic Programming for Storage Problems

Sean Gerrish and David Blei: Predicting Legislative Roll Calls from Text

Richard Socher, Cliff Chiung-Yu Lin, Andrew Ng, and Chris Manning: Parsing Natural Scenes and Natural Language with Recursive Neural Networks

Test-of-Time Award

This award is given to papers that time and hindsight proved to be of lasting value to the Machine Learning community. This year, we are recognizing the seminal paper of CRFs. John D. Lafferty, Andrew McCallum, Fernando C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.

Printed Proceedings

You can purchase the printed proceedings online for $100 and pick them up during the conference at the registration office, aka the Grand Ballroom Coat Check.

Online Proceedings

These are the abstracts of the accepted papers. You can download the whole proceedings (87MB zip) and the summary bibfile.

Hashing with Graphs

Wei Liu, Jun Wang, Sanjiv Kumar, Shih-Fu Chang

Abstract:Hashing is becoming increasingly popular for efficient nearest neighbor search in massive databases. However, learning short codes that yield good search performance is still a challenge. Moreover, in many cases real-world data lives on a low-dimensional manifold, which should be taken into account to capture meaningful nearest neighbors. In this paper, we propose a novel graph-based hashing method which automatically discovers the neighborhood structure inherent in the data to learn appropriate compact codes. To make such an approach computationally feasible, we utilize Anchor Graphs to obtain tractable low-rank adjacency matrices. Our formulation allows constant time hashing of a new data point by extrapolating graph Laplacian eigenvectors to eigenfunctions. Finally, we describe a hierarchical threshold learning procedure in which each eigenfunction yields multiple bits, leading to higher search accuracy. Experimental comparison with the other state-of-the-art methods on two large datasets demonstrates the efficacy of the proposed method.

Contents

Awards

Best Paper

Distinguished Paper Awards

Distinguished Application Paper Awards

Test-of-Time Award

Printed Proceedings

Online Proceedings

Hashing with Graphs

Efficient Sparse Modeling with Automatic Feature Grouping

Multi-Label Classification on Tree- and DAG-Structured Hierarchies

A Graph-based Framework for Multi-Task Multi-View Learning

GoDec: Randomized Low-rank & Sparse Matrix Decomposition in Noisy Case

Unimodal Bandits

Learning Output Kernels with Block Coordinate Descent

Vector-valued Manifold Regularization

On Information-Maximization Clustering: Tuning Parameter Selection and Analytic Solution

On tracking portfolios with certainty equivalents on a generalization of Markowitz model: the Fool, the Wise and the Adaptive

Multiple Instance Learning with Manifold Bags

Eigenvalue Sensitive Feature Selection

Large Scale Text Classification using Semi-supervised Multinomial Naive Bayes

Enhanced Gradient and Adaptive Learning Rate for Training Restricted Boltzmann Machines

Dynamic Tree Block Coordinate Ascent

Implementing regularization implicitly via approximate eigenvector computation

Parsing Natural Scenes and Natural Language with Recursive Neural Networks

Conjugate Markov Decision Processes

Learning Mallows Models with Pairwise Preferences

Surrogate losses and regret bounds for cost-sensitive classification with example-dependent costs

Efficient Rule Ensemble Learning using Hierarchical Kernels

An Augmented Lagrangian Approach to Constrained MAP Inference

Mean-Variance Optimization in Markov Decision Processes

Time Series Clustering: Complex is Simpler!

Max-margin Learning for Lower Linear Envelope Potentials in Binary Markov Random Fields

Inference of Inversion Transduction Grammars

BCDNPKL: Scalable Non-Parametric Kernel Learning Using Block Coordinate Descent

Learning Discriminative Fisher Kernels

Pruning nearest neighbor cluster trees

Online AUC Maximization

Beat the Mean Bandit

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning

Estimating the Bayes Point Using Linear Knapsack Problems

On optimization methods for deep learning

Multiclass Classification with Bandit Feedback using Adaptive Regularization

On the Necessity of Irrelevant Variables

ABC-EP: Expectation Propagation for Likelihood-free Bayesian Computation

A PAC-Bayes Sample-compression Approach to Kernel Methods

Integrating Partial Model Knowledge in Model Free RL Algorithms

Fast Newton-type Methods for Total Variation Regularization

Parallel Coordinate Descent for L1-Regularized Loss Minimization

Large-Scale Convex Minimization with a Low-Rank Constraint

Approximate Dynamic Programming for Storage Problems

Online Submodular Minimization for Combinatorial Structures

Minimal Loss Hashing for Compact Binary Codes

The Hierarchical Beta Process for Convolutional Factor Analysis and Deep Learning

Simultaneous Learning and Covering with Adversarial Noise

Topic Modeling with Nonparametric Markov Tree

Relational Active Learning for Joint Collective Classification Models

A Co-training Approach for Multi-view Spectral Clustering

Learning from Multiple Outlooks

Adaptive Kernel Approximation for Large-Scale Non-Linear SVM Prediction

Risk-Based Generalizations of f-divergences

Learning Multi-View Neighborhood Preserving Projections

Better Algorithms for Selective Sampling

Minimax Learning Rates for Bipartite Ranking and Plug-in Rules

Task Space Retrieval Using Inverse Feedback Control

Bayesian CCA via Group Sparsity

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

Suboptimal Solution Path Algorithm for Support Vector Machine

Incremental Basis Construction from Temporal Difference Error

Predicting Legislative Roll Calls from Text

On Bayesian PCA: Automatic Dimensionality Selection and Analytic Solution

Learning Linear Functions with Quadratic and Linear Multiplicative Updates

Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach

Learning with Whom to Share in Multi-task Feature Learning

Boosting on a Budget: Sampling for Feature-Efficient Prediction

Speeding-Up Hoeffding-Based Regression Trees With Options

Linear Regression under Fixed-Rank Constraints: A Riemannian Approach

Cauchy Graph Embedding

Uncovering the Temporal Dynamics of Diffusion Networks

Multiclass Boosting with Hinge Loss based on Output Coding

Functional Regularized Least Squares Classication with Operator-valued Kernels