Skip to content

Pinned Loading

  1. SWE-agent SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

    Python 13.4k 1.3k

  2. tree-of-thought-llm tree-of-thought-llm Public

    [NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

    Python 4.7k 436

  3. LLM-Shearing LLM-Shearing Public

    [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

    Python 544 42

  4. SWE-bench SWE-bench Public

    [ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

    Python 1.8k 311

  5. SimCSE SimCSE Public

    [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

    Python 3.4k 511

  6. MeZO MeZO Public

    [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

    Python 1k 62

Repositories

Showing 10 of 81 repositories
  • HELMET Public
    princeton-nlp/HELMET’s past year of commit activity
    Python 16 MIT 1 0 0 Updated Oct 4, 2024
  • ProLong Public

    Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

    princeton-nlp/ProLong’s past year of commit activity
    26 MIT 0 1 0 Updated Oct 4, 2024
  • SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

    princeton-nlp/SWE-agent’s past year of commit activity
    Python 13,402 MIT 1,336 54 15 Updated Oct 3, 2024
  • princeton-nlp/benign-data-breaks-safety’s past year of commit activity
    Python 15 0 0 0 Updated Oct 1, 2024
  • NLProofS Public

    EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

    princeton-nlp/NLProofS’s past year of commit activity
    Python 81 MIT 15 0 0 Updated Sep 15, 2024
  • MQuAKE Public

    [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

    princeton-nlp/MQuAKE’s past year of commit activity
    Jupyter Notebook 97 MIT 5 7 0 Updated Sep 12, 2024
  • AutoCompressors Public

    [EMNLP 2023] Adapting Language Models to Compress Long Contexts

    princeton-nlp/AutoCompressors’s past year of commit activity
    Python 273 20 6 0 Updated Sep 9, 2024
  • WebShop Public

    [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

    princeton-nlp/WebShop’s past year of commit activity
    Python 261 MIT 55 4 1 Updated Sep 6, 2024
  • SWE-bench Public

    [ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

    princeton-nlp/SWE-bench’s past year of commit activity
    Python 1,813 MIT 311 20 9 Updated Sep 3, 2024
  • SimPO Public

    SimPO: Simple Preference Optimization with a Reference-Free Reward

    princeton-nlp/SimPO’s past year of commit activity
    Python 665 MIT 42 12 0 Updated Aug 22, 2024