Computational Automated Learning Lab

2016

Dantzig Selector with an Approximately Optimal Denoising Matrix and its Application to Reinforcement Learning.[pdf]
B Liu, L Zhang, J Liu.
32nd Conference on Uncertainty in Artificial Intelligence (UAI), Jersey City, NJ, 2016
Proximal Gradient Temporal Difference Learning Algorithms.[pdf]
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik.
25th International Joint Conference on Artificial Intelligence (IJCAI), New York City, 2016
Uncorrelated Group LASSO[pdf]
D Kong, J Liu, B Liu, X Bao.
30th AAAI Conference on Artificial Intelligence (AAAI), Phoenix, AZ, Feb 12-17, 2016

2015

Finite-Sample Analysis of Proximal Gradient TD Algorithms[pdf]
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik.
31st Conference on Uncertainty in Artificial Intelligence (UAI), Amsterdam, The Netherlands, July 12-16, 2015, Facebook Best Student Paper Award.

2014

Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces.[pdf]
S Mahadevan, B Liu, P Thomas, W Dabney, S Giguere, N Jacek, I Gemp, J Liu
arXiv preprint arXiv:1405.6757, 2014
Bluetooth aided mobile phone localization: a nonlinear neural circuit approach.[pdf]
S Li, Y Lou, B Liu
ACM Transactions on Embedded Computing Systems (ACM TECS), 2014

2013

Selective Positive-Negative Feedback Produces the Winner-Take-All Competition in Recurrent Neural Networks.[pdf]
S Li, B Liu, Y Li
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNN) 24, 301-309, 2013

2012

Regularized Off-Policy TD-Learning.[pdf]
B Liu, S Mahadevan, J Liu.
26th Annual Conference on Neural Information Processing Systems (NIPS), Lake Tahoe, Nevada, 2012, December 3-6, NIPS Spotlight (5% acceptance).
Sparse Q-learning with Mirror Descent.[pdf]
S Mahadevan, B Liu.
28th Conference on Uncertainty in Artificial Intelligence (UAI), August 15-17, 2012, Catalina Island, CA.
Sparse Manifold Alignment.[pdf]
B Liu, C Wang, H Vu, S Mahadevan.
Univ. of Massachusetts Technical Report UM-CS-2012-030.
Decentralized kinematic control of a class of collaborative redundant manipulators via recurrent neural networks.[pdf]
S Li, S Chen, B Liu, Y Li, Y Liang
Neurocomputing, 2012, One of the most-cited Neurocomputing paper since 2012 according to Scopus
Neural network based mobile phone localization using Bluetooth connectivity.[pdf]
S Li, B Liu, B Chen, Y Lou
Neural Computing & Applications, 2012
Decentralized control of collaborative redundant manipulators with partial command coverage via locally connected recurrent neural networks.[pdf]
S Li, H Cui, Y Li, B Liu, Y Lou
Neural Computing & Applications, 1-10, 2012
A Nonlinear Model to Generate the Winner-take-all Competition.[pdf]
S Li, Y Wang, J Yu, B Liu
Communications in Nonlinear Science and Numerical Simulation, 2012

2011

Compressive Reinforcement Learning with Oblique Random Projections.[pdf]
B Liu, S Mahadevan.
Univ. of Massachusetts Technical Report UM-CS-2011-024.

2010

Basis Construction from Power Series Expansions of Value Functions.[pdf]
S Mahadevan, B Liu.
24th Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, B.C., Canada, 2010, December 6-8.
Two-time-scale online actor-critic paradigm driven by POMDP.[pdf]
B Liu, H He, DW Repperger
International Conference on Networking, Sensing and Control (ICNSC), 2010.
Adaptive Dual Network Design for a Class of SIMO Systems with Nonlinear Time-variant Uncertainties.[pdf]
B Liu, HB He, S Chen
Acta Automatica Sinica 36 (4), 564-572, 2010

Computational Automated Learning Laboratory