Skip to main content

Showing 1–9 of 9 results for author: Liaw, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2001.02338  [pdf, other

    cs.DC cs.LG

    HyperSched: Dynamic Resource Reallocation for Model Development on a Deadline

    Authors: Richard Liaw, Romil Bhardwaj, Lisa Dunlap, Yitian Zou, Joseph Gonzalez, Ion Stoica, Alexey Tumanov

    Abstract: Prior research in resource scheduling for machine learning training workloads has largely focused on minimizing job completion times. Commonly, these model training workloads collectively search over a large number of parameter values that control the learning process in a hyperparameter search. It is preferable to identify and maximally provision the best-performing hyperparameter configuration (… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

  2. arXiv:1912.00167  [pdf, other

    cs.LG stat.ML

    IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

    Authors: Michael Luo, Jiahao Yao, Richard Liaw, Eric Liang, Ion Stoica

    Abstract: The practical usage of reinforcement learning agents is often bottlenecked by the duration of training time. To accelerate training, practitioners often turn to distributed reinforcement learning architectures to parallelize and accelerate the training process. However, modern methods for scalable reinforcement learning (RL) often tradeoff between the throughput of samples that an RL agent can lea… ▽ More

    Submitted 23 January, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: ICLR 2020 Publication; 14 pages, 10 figures

  3. arXiv:1810.01021  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Large batch size training of neural networks with adversarial training and second-order information

    Authors: Zhewei Yao, Amir Gholami, Daiyaan Arfeen, Richard Liaw, Joseph Gonzalez, Kurt Keutzer, Michael Mahoney

    Abstract: The most straightforward method to accelerate Stochastic Gradient Descent (SGD) computation is to distribute the randomly selected batch of inputs over multiple processors. To keep the distributed processors fully utilized requires commensurately growing the batch size. However, large batch training often leads to poorer generalization. A recently proposed solution for this problem is to use adapt… ▽ More

    Submitted 2 January, 2020; v1 submitted 1 October, 2018; originally announced October 2018.

  4. arXiv:1807.05118  [pdf, other

    cs.LG cs.DC stat.ML

    Tune: A Research Platform for Distributed Model Selection and Training

    Authors: Richard Liaw, Eric Liang, Robert Nishihara, Philipp Moritz, Joseph E. Gonzalez, Ion Stoica

    Abstract: Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 8 Pages, Presented at the 2018 ICML AutoML workshop

  5. arXiv:1712.09381  [pdf, other

    cs.AI cs.DC cs.LG

    RLlib: Abstractions for Distributed Reinforcement Learning

    Authors: Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzalez, Michael I. Jordan, Ion Stoica

    Abstract: Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable way by adapting algorithms for top-down hierarchical control, thereby encapsulating parallelism and resource requirements within short-running compute tasks. We d… ▽ More

    Submitted 28 June, 2018; v1 submitted 26 December, 2017; originally announced December 2017.

    Comments: Published in the International Conference on Machine Learning (ICML 2018), 10 pages

  6. arXiv:1712.05889  [pdf, other

    cs.DC cs.AI cs.LG stat.ML

    Ray: A Distributed Framework for Emerging AI Applications

    Authors: Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, Ion Stoica

    Abstract: The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performance and flexibility. In this paper, we consider these requirements and present Ray---a distributed system to address them. Ray implements a unified interface that can express both task-pa… ▽ More

    Submitted 29 September, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: 17 pages, 14 figures, 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

  7. arXiv:1711.01503  [pdf, other

    cs.AI

    Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

    Authors: Richard Liaw, Sanjay Krishnan, Animesh Garg, Daniel Crankshaw, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Rather than learning new control policies for each new task, it is possible, when tasks share some structure, to compose a "meta-policy" from previously learned policies. This paper reports results from experiments using Deep Reinforcement Learning on a continuous-state, discrete-action autonomous driving simulator. We explore how Deep Neural Networks can represent meta-policies that switch among… ▽ More

    Submitted 4 November, 2017; originally announced November 2017.

    Comments: 8 pages, 11 figures

  8. arXiv:1703.03924  [pdf, other

    cs.DC cs.AI cs.LG

    Real-Time Machine Learning: The Missing Pieces

    Authors: Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul, Johann Schleier-Smith, Richard Liaw, Mehrdad Niknami, Michael I. Jordan, Ion Stoica

    Abstract: Machine learning applications are increasingly deployed not only to serve predictions using static models, but also as tightly-integrated components of feedback loops involving dynamic, real-time decision making. These applications pose a new set of requirements, none of which are difficult to achieve in isolation, but the combination of which creates a challenge for existing distributed execution… ▽ More

    Submitted 19 May, 2017; v1 submitted 11 March, 2017; originally announced March 2017.

    Comments: 6 pages, 3 figures

  9. arXiv:1604.06508  [pdf, other

    cs.RO

    HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards

    Authors: Sanjay Krishnan, Animesh Garg, Richard Liaw, Lauren Miller, Florian T. Pokorny, Ken Goldberg

    Abstract: Reinforcement Learning (RL) struggles in problems with delayed rewards, and one approach is to segment the task into sub-tasks with incremental rewards. We propose a framework called Hierarchical Inverse Reinforcement Learning (HIRL), which is a model for learning sub-task structure from demonstrations. HIRL decomposes the task into sub-tasks based on transitions that are consistent across demonst… ▽ More

    Submitted 21 April, 2016; originally announced April 2016.

    Comments: 12 pages