Highlights
- Pro
Stars
EvaByte: Efficient Byte-level Language Models at Scale
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Class-Conditional self-reward mechanism for improved Text-to-Image models
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
A suite of image and video neural tokenizers
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Large Concept Models: Language modeling in a sentence representation space
A general fine-tuning kit geared toward diffusion models.
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
COYO-700M: Large-scale Image-Text Pair Dataset
A generative world for general-purpose robotics & embodied AI learning.
A bibliography and survey of the papers surrounding o1
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts