PhD in Reinforcement Learning & Deep Learning
-
Mila Quebec AI Institute
- Montreal, CA
- halixness.github.io
- https://scholar.google.com/citations?user=0xjQqX4AAAAJ&hl=en
Pinned Loading
-
lucidrains/metacontroller
lucidrains/metacontroller PublicImplementation of the MetaController proposed in "Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning"
-
llama-titans
llama-titans PublicForked from lucidrains/titans-pytorch
Adaptation of titans-pytorch to llama models on HF
-
refgen
refgen PublicOfficial implementation of "Discovery of Sustainable Refrigerants through Physics-Informed RL Fine-Tuning of Sequence Models" by Goldszal, Calanzone et al. 2025, presented at SIMBIOCHEM @ EurIPS2025
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
