Stars
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Official Repository of ChatCaptioner
VideoLLM: Modeling Video Sequence with Large Language Models
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
PyTorch implementation of Barlow Twins.
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Stock options, RSUs, taxes — read the latest edition: www.holloway.com/ec
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Supplementary code for our paper: Heavy-tailed noise does not explain the gap between SGD and Adam on Transformers
The codebase for the paper "A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks"
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Magenta: Music and Art Generation with Machine Intelligence