2016
- Dantzig Selector with an Approximately Optimal Denoising Matrix and its Application to Reinforcement Learning.[pdf]
B Liu, L Zhang, J Liu.
32nd Conference on Uncertainty in Artificial Intelligence (UAI), Jersey City, NJ, 2016
- Proximal Gradient Temporal Difference Learning Algorithms.[pdf]
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik.
25th International Joint Conference on Artificial Intelligence (IJCAI), New York City, 2016
- Uncorrelated Group LASSO[pdf]
D Kong, J Liu, B Liu, X Bao.
30th AAAI Conference on Artificial Intelligence (AAAI), Phoenix, AZ, Feb 12-17, 2016
2015
- Finite-Sample Analysis of Proximal Gradient TD Algorithms[pdf]
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik.
31st Conference on Uncertainty in Artificial Intelligence (UAI), Amsterdam, The Netherlands, July 12-16, 2015, Facebook Best Student Paper Award.
2014
- Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces.[pdf]
S Mahadevan, B Liu, P Thomas, W Dabney, S Giguere, N Jacek, I Gemp, J Liu
arXiv preprint arXiv:1405.6757, 2014
- Bluetooth aided mobile phone localization: a nonlinear neural circuit approach.[pdf]
S Li, Y Lou, B Liu
ACM Transactions on Embedded Computing Systems (ACM TECS), 2014
2013
- Selective Positive-Negative Feedback Produces the Winner-Take-All Competition in Recurrent Neural Networks.[pdf]
S Li, B Liu, Y Li
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNN) 24, 301-309, 2013
2012
- Regularized Off-Policy TD-Learning.[pdf]
B Liu, S Mahadevan, J Liu.
26th Annual Conference on Neural Information Processing Systems (NIPS), Lake Tahoe, Nevada, 2012, December 3-6, NIPS Spotlight (5% acceptance).
- Sparse Q-learning with Mirror Descent.[pdf]
S Mahadevan, B Liu.
28th Conference on Uncertainty in Artificial Intelligence (UAI), August 15-17, 2012, Catalina Island, CA.
- Sparse Manifold Alignment.[pdf]
B Liu, C Wang, H Vu, S Mahadevan.
Univ. of Massachusetts Technical Report UM-CS-2012-030.
- Decentralized kinematic control of a class of collaborative redundant manipulators via recurrent neural networks.[pdf]
S Li, S Chen, B Liu, Y Li, Y Liang
Neurocomputing, 2012, One of the most-cited Neurocomputing paper since 2012 according to Scopus
- Neural network based mobile phone localization using Bluetooth connectivity.[pdf]
S Li, B Liu, B Chen, Y Lou
Neural Computing & Applications, 2012
- Decentralized control of collaborative redundant manipulators with partial command coverage via locally connected recurrent neural networks.[pdf]
S Li, H Cui, Y Li, B Liu, Y Lou
Neural Computing & Applications, 1-10, 2012
- A Nonlinear Model to Generate the Winner-take-all Competition.[pdf]
S Li, Y Wang, J Yu, B Liu
Communications in Nonlinear Science and Numerical Simulation, 2012
2011
- Compressive Reinforcement Learning with Oblique Random Projections.[pdf]
B Liu, S Mahadevan.
Univ. of Massachusetts Technical Report UM-CS-2011-024.
2010
- Basis Construction from Power Series Expansions of Value Functions.[pdf]
S Mahadevan, B Liu.
24th Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, B.C., Canada, 2010, December 6-8.
- Two-time-scale online actor-critic paradigm driven by POMDP.[pdf]
B Liu, H He, DW Repperger
International Conference on Networking, Sensing and Control (ICNSC), 2010.
- Adaptive Dual Network Design for a Class of SIMO Systems with Nonlinear Time-variant Uncertainties.[pdf]
B Liu, HB He, S Chen
Acta Automatica Sinica 36 (4), 564-572, 2010