dGSLM

dGSLM is a project for textless speech modeling using Fairseq. This guide will help you set up the environment and install the necessary dependencies.

Environment Setup

Follow the steps below to set up the environment using conda and pip:

1. Create and activate a Conda environment

conda create -n dGSLM python=3.9
conda activate dGSLM

2. Install PyTorch and dependencies

Install PyTorch, torchvision, and torchaudio with CUDA 12.1 support:conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

3. Upgrade `pip` and install required packages

pip install pip==23.3.1
pip install omegaconf==2.0.6

4. Install Fairseq

Clone the Fairseq repository and install it in editable mode:

git clone [email protected]:facebookresearch/fairseq.git
cd fairseq
pip install --editable ./

5. Install additional dependencies

pip install soundfile librosa

6. Run Inference

python src/inference.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
fairseq @ ecbf110		fairseq @ ecbf110
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
out_wav_0123.wav		out_wav_0123.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dGSLM

Environment Setup

1. Create and activate a Conda environment

2. Install PyTorch and dependencies

3. Upgrade `pip` and install required packages

4. Install Fairseq

5. Install additional dependencies

6. Run Inference

About

Releases

Packages

Languages

Berkeley-Speech-Group/dGSLM

Folders and files

Latest commit

History

Repository files navigation

dGSLM

Environment Setup

1. Create and activate a Conda environment

2. Install PyTorch and dependencies

3. Upgrade pip and install required packages

4. Install Fairseq

5. Install additional dependencies

6. Run Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

3. Upgrade `pip` and install required packages

Packages