(subjectif, tbc)
Tutorials
- Langford, Structured prediction. http://www.hunch.net/~l2s/
Invited talk: Bottou +++
- 2 learning modules in interaction (one explores, one classifies): misleading effects (exploration results are bad on average, therefore no need to explore, we were right the first time)
- test: must be reconsidered. (consider the tails of distribution and coverage)
Papers
- Unsupervised Domain Adaptation by Backpropagation Yaroslav Ganin, Victor Lempitsky
- two objectives on features: being discriminative wrt class; not discriminant wrt source/target
- Learning Transferable Features with Deep Adaptation Networks Mingsheng Long, Yue Cao, Jianmin Wang, Michael Jordan
- related to the previous + kernels.
- Strongly Adaptive Online Learning Amit Daniely, Alon Gonen, Shai Shalev-Shwartz
- Online and adaptive weights on experts + intervals + doubling trick
- Adaptive Belief Propagation Georgios Papachristoudis, John Fisher
- Question for me: why not considering several spanning trees...
- Weight Uncertainty in Neural Network Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, Daan Wierstra
- weight = Gaussian; + Bayes by backprop (Graves).
- Gradient-based Hyperparameter Optimization through Reversible Learning, Dougal Maclaurin, David Duvenaud, Ryan Adams
- derivative of misclassification wrt hyper-parameters.
- On Symmetric and Asymmetric LSHs for Inner Product Search Behnam Neyshabur, Nathan Srebro
- Different random projections for queries and for solutions.
- The Ladder: A Reliable Leaderboard for Machine Learning Competitions Avrim Blum, Moritz Hardt
- validation set, test set, multiple trials.
- Learning to Search Better than Your Teacher Kai-Wei Chang, Akshay Krishnamurthy, Alekh Agarwal, Hal Daume, John Langford
- ??
- Learning Fast-Mixing Models for Structured Prediction Jacob Steinhardt, Percy Liang
??
Papers where I think one could do otherwise
- On Identifying Good Options under Combinatorially Structured Feedback in Finite Noisy Environments Yifan Wu, Andras Gyorgy, Csaba Szepesvari
- MCTS ?
Papers where I must have missed something (because otherwise, ...)
- Coordinate Descent Converges Faster with the Gauss-Southwell Rule Than Random Selection Julie Nutini, Mark Schmidt, Issam Laradji, Michael Friedlander, Hoyt Koepke
- not compared to BFGS !!!
Papiers anciens que j'avais manqués
- Label-Embedding for Attribute-Based Classification (embarrassingly simple).
Ce qui peut interesser...
Cécile: (rapport avec la thèse de Dawei).
On Identifying Good Options under Combinatorially Structured Feedback in Finite Noisy Environments
Yifan Wu, Andras Gyorgy, Csaba Szepesvari