Skip to content

Pinned Loading

  1. EditAnything EditAnything Public

    Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

    Python 3.3k 189

  2. envpool envpool Public

    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

    C++ 1.1k 100

  3. mvp mvp Public

    NeurIPS-2021: Direct Multi-view Multi-person 3D Human Pose Estimation

    Python 329 34

  4. Adan Adan Public

    Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

    Python 752 64

  5. lorahub lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 581 35

  6. zero-bubble-pipeline-parallelism zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 270 14

Repositories

Showing 10 of 67 repositories
  • Cheating-LLM-Benchmarks Public

    [SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

    sail-sg/Cheating-LLM-Benchmarks’s past year of commit activity
    Jupyter Notebook 49 MIT 0 0 0 Updated Oct 23, 2024
  • P-DoS Public

    [ArXiv 2024] Denial-of-Service Poisoning Attacks on Large Language Models

    sail-sg/P-DoS’s past year of commit activity
    Python 11 2 0 0 Updated Oct 22, 2024
  • CPO Public

    [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

    sail-sg/CPO’s past year of commit activity
    Python 49 1 1 0 Updated Oct 18, 2024
  • SimLayerKV Public

    The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

    sail-sg/SimLayerKV’s past year of commit activity
    Python 29 0 0 0 Updated Oct 18, 2024
  • Attention-Sink Public

    [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View

    sail-sg/Attention-Sink’s past year of commit activity
    Python 24 MIT 1 0 0 Updated Oct 17, 2024
  • sail-sg/Meta-Unlearning’s past year of commit activity
    Python 12 0 0 0 Updated Oct 17, 2024
  • closer-look-LLM-unlearning Public

    The official code of the paper "A Closer Look at Machine Unlearning for Large Language Models".

    sail-sg/closer-look-LLM-unlearning’s past year of commit activity
    Python 9 1 0 0 Updated Oct 11, 2024
  • regmix Public

    🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

    sail-sg/regmix’s past year of commit activity
    Jupyter Notebook 86 MIT 4 0 0 Updated Oct 3, 2024
  • scaling-with-vocab Public

    [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623

    sail-sg/scaling-with-vocab’s past year of commit activity
    Python 65 4 1 0 Updated Sep 26, 2024
  • sdft Public

    [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

    sail-sg/sdft’s past year of commit activity
    Shell 91 4 3 0 Updated Sep 19, 2024

Top languages

Loading…

Most used topics

Loading…