stat946F18: Difference between revisions

From statwiki
Jump to navigation Jump to search
Line 67: Line 67:
|NOv 27 ||Xudong Peng  || 9||  ||  ||  
|NOv 27 ||Xudong Peng  || 9||  ||  ||  
|-
|-
|Nov 27 ||Xinyue Zhang  || 10|| A Distributional Perspective on Reinforcement Learning || [http://proceedings.mlr.press/v70/bellemare17a/bellemare17a.pdf Paper] ||  
|Nov 27 ||Xinyue Zhang  || 10|| An Inference-Based Policy Gradient Method for Learning Options || [http://proceedings.mlr.press/v80/smith18a/smith18a.pdf Paper] ||  
|-
|-
|NOv 29 ||Junyi Zhang  || 11||  ||  ||  
|NOv 29 ||Junyi Zhang  || 11||  ||  ||  
|-
|-
|Nov 29 ||Travis Bender  || 12|| Automatic Goal Generation for Reinforcement Learning Agents || [http://proceedings.mlr.press/v80/florensa18a/florensa18a.pdf Paper]  ||
|Nov 29 ||Travis Bender  || 12|| Automatic Goal Generation for Reinforcement Learning Agents || [http://proceedings.mlr.press/v80/florensa18a/florensa18a.pdf Paper]  ||

Revision as of 11:18, 4 October 2018

Paper presentation

Date Name Paper number Title Link to the paper Link to the summary
Feb 15 (example) Ri Wang Sequence to sequence learning with neural networks. Paper Summary
Oct 25 1
Oct 25 2
Oct 25 3
Oct 30 Manpreet Singh Minhas 1
Oct 30 Marvin Pafla 2
Nov 1 Sriram Ganapathi Subramanian 1 Mean Field Multi-Agent Reinforcement Learning Paper
Nov 1 Hadi Nekoei 1
Nov 1 Henry Chen 1 DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks Paper
Nov 6 Nargess Heydari 2
Nov 6 Aravind Ravi 3
Nov 8 Neel Bhatt 1 [TBD]
Nov 8 Jacob Manuel 2
Nov 8 Charupriya Sharma 2
NOv 13 Sagar Rajendran 1 Zero-Shot Visual Imitation Paper
Nov 13 Jiazhen Chen 2
Nov 13 Neil Budnarain 2
NOv 15 Zheng Ma 3 Reinforcement Learning of Theorem Proving Paper
Nov 15 Abdul Khader Naik 4
Nov 15 Johra Muhammad Moosa 2
NOv 20 Zahra Rezapour Siahgourabi 19
Nov 20 Shubham Koundinya 6
Nov 20 Salman Khan Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples paper
NOv 22 Soroush Ameli 22
Nov 22 Ivan Li 23 Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate Paper
Nov 22 Sigeng Chen 2
Nov 27 Aileen Li 8 Spatially Transformed Adversarial Examples Paper
NOv 27 Xudong Peng 9
Nov 27 Xinyue Zhang 10 An Inference-Based Policy Gradient Method for Learning Options Paper
NOv 29 Junyi Zhang 11
Nov 29 Travis Bender 12 Automatic Goal Generation for Reinforcement Learning Agents Paper