Nathan Kallus

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Masatoshi Uehara

> Home > Persons > Nathan Kallus

Publications

2025
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/mor/HuKU25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mor/HuKU25
Yichun Hu, Nathan Kallus, Masatoshi Uehara:
Fast Rates for the Regret of Offline Reinforcement Learning. Math. Oper. Res. 50(1): 633-655 (2025)
2024
[j19]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/KallusMU24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/KallusMU24
Nathan Kallus, Xiaojie Mao, Masatoshi Uehara:
Localized Debiased Machine Learning: Efficient Inference on Quantile Treatment Effects and Beyond. J. Mach. Learn. Res. 25: 16:1-16:59 (2024)
[c68]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhanUKL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhanUKL024
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun:
Provable Offline Preference-Based Reinforcement Learning. ICLR 2024
2023
[c53]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/BennettKMNSU23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/BennettKMNSU23
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara:
Inference on Strongly Identified Functionals of Weakly Identified Functions. COLT 2023: 2265
[c52]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/BennettKMNSU23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/BennettKMNSU23a
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara:
Minimax Instrumental Variable Regression and L₂ Convergence Guarantees without Identification or Closedness. COLT 2023: 2291-2318
[c49]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/UeharaSLK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/UeharaSLK023
Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun:
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings. ICML 2023: 34615-34641
[c47]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/UeharaKBC0KS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/UeharaKBC0KS023
Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun:
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs. NeurIPS 2023
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/UeharaKL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/UeharaKL023
Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun:
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage. NeurIPS 2023
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-02392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-02392
Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun:
Refined Value-Based Offline RL under Realizability and Partial Coverage. CoRR abs/2302.02392 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05404
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara:
Minimax Instrumental Variable Regression and $L_2$ Convergence Guarantees without Identification or Closedness. CoRR abs/2302.05404 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14816
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun:
Provable Offline Reinforcement Learning with Human Feedback. CoRR abs/2305.14816 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-13793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-13793
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara:
Source Condition Double Robust Inference on Functionals of Inverse Problems. CoRR abs/2307.13793 (2023)
2022
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ior/KallusU22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ior/KallusU22
Nathan Kallus, Masatoshi Uehara:
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning. Oper. Res. 70(6): 3282-3302 (2022)
[c37]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/UeharaSLK022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/UeharaSLK022
Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun:
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems. NeurIPS 2022
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12020
Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun:
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems. CoRR abs/2206.12020 (2022)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12081
Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun:
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings. CoRR abs/2206.12081 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-13081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-13081
Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun:
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs. CoRR abs/2207.13081 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-06355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-06355
Masatoshi Uehara, Chengchun Shi, Nathan Kallus:
A Review of Off-Policy Evaluation in Reinforcement Learning. CoRR abs/2212.06355 (2022)
2021
[c35]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/HuKU21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/HuKU21
Yichun Hu, Nathan Kallus, Masatoshi Uehara:
Fast Rates for the Regret of Offline Reinforcement Learning. COLT 2021: 2462
[c33]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KallusSU21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KallusSU21
Nathan Kallus, Yuta Saito, Masatoshi Uehara:
Optimal Off-Policy Evaluation from Multiple Logging Policies. ICML 2021: 5247-5256
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-00479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-00479
Yichun Hu, Nathan Kallus, Masatoshi Uehara:
Fast Rates for the Regret of Offline Reinforcement Learning. CoRR abs/2102.00479 (2021)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-02981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-02981
Masatoshi Uehara, Masaaki Imaizumi, Nan Jiang, Nathan Kallus, Wen Sun, Tengyang Xie:
Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency. CoRR abs/2102.02981 (2021)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14029
Nathan Kallus, Xiaojie Mao, Masatoshi Uehara:
Causal Inference Under Unmeasured Confounding With Negative Controls: A Minimax Learning Approach. CoRR abs/2103.14029 (2021)
2020
[j5]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/KallusU20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/KallusU20
Nathan Kallus, Masatoshi Uehara:
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes. J. Mach. Learn. Res. 21: 167:1-167:63 (2020)
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KallusU20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KallusU20
Nathan Kallus, Masatoshi Uehara:
Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation. ICML 2020: 5078-5088
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KallusU20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KallusU20a
Nathan Kallus, Masatoshi Uehara:
Statistically Efficient Off-Policy Policy Gradients. ICML 2020: 5089-5100
[c23]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KallusU20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KallusU20
Nathan Kallus, Masatoshi Uehara:
Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies. NeurIPS 2020
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-04014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-04014
Nathan Kallus, Masatoshi Uehara:
Statistically Efficient Off-Policy Policy Gradients. CoRR abs/2002.04014 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03886
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03886
Nathan Kallus, Masatoshi Uehara:
Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning. CoRR abs/2006.03886 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03900
Nathan Kallus, Masatoshi Uehara:
Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies. CoRR abs/2006.03900 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11002
Nathan Kallus, Yuta Saito, Masatoshi Uehara:
Optimal Off-Policy Evaluation from Multiple Logging Policies. CoRR abs/2010.11002 (2020)
2019
[c18]
- view
- export record
  dblp key:
  - conf/nips/KallusU19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KallusU19
Nathan Kallus, Masatoshi Uehara:
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning. NeurIPS 2019: 3320-3329
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03735
Nathan Kallus, Masatoshi Uehara:
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning. CoRR abs/1906.03735 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-08526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-08526
Nathan Kallus, Masatoshi Uehara:
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes. CoRR abs/1908.08526 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-05850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-05850
Nathan Kallus, Masatoshi Uehara:
Efficiently Breaking the Curse of Horizon: Double Reinforcement Learning in Infinite-Horizon Processes. CoRR abs/1909.05850 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-12945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-12945
Nathan Kallus, Xiaojie Mao, Masatoshi Uehara:
Localized Debiased Machine Learning: Efficient Estimation of Quantile Treatment Effects, Conditional Value at Risk, and Beyond. CoRR abs/1912.12945 (2019)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.