This repository contains the Jupyter Notebook used for fine-tuning the Mistral-7B model on the Alpaca dataset. The fine-tuned model, optimized for various NLP tasks, is available on Hugging Face.
- Model Name: Mistral-7B (4-bit Quantized)
- Hugging Face Model Page: Mistral-7B (4-bit)
- Dataset: Alpaca Cleaned Link
- Data Preparation: Load and prepare the Alpaca dataset.
- Model Training: Fine-tune the Mistral-7B model with 4-bit quantization.
- Evaluation: Assess the model's performance.
- Inference: Generate predictions using the fine-tuned model.
To set up the environment and run the notebook, follow these steps:
-
Install Conda: Ensure you have Conda installed. If not, download and install it from Anaconda or Miniconda.
-
Create and Activate a New Conda Environment:
conda create -n mistral-finetune python=3.10 conda activate mistral-finetune
-
Install PyTorch and CUDA Dependencies:
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
-
Install Additional Packages:
pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git" pip install --no-deps "trl<0.9.0" peft accelerate bitsandbytes
-
Verify PyTorch and CUDA Installation:
python -c "import torch; print(torch.version.cuda)"
For Pytorch 2.3.0 and newer RTX 30xx GPUs or higher, use:
pip install "unsloth[cu121-ampere-torch230] @ git+https://github.com/unslothai/unsloth.git"
-
Install TensorFlow: For GPU users:
pip install tensorflow[and-cuda]
For CPU users:
pip install tensorflow
-
Verify TensorFlow Installation:
python3 -c "import tensorflow as tf; print(tf.config.list_physical_devices('GPU'))"
If a list of GPU devices is returned, TensorFlow is installed successfully.
-
Download the Notebook: Clone or download this repository to your local machine.
-
Open the Jupyter Notebook: Launch Jupyter Notebook from the environment:
jupyter notebook
-
Execute the Cells: Follow the instructions in the notebook to reproduce the fine-tuning process.
Feel free to explore the notebook, run experiments, and provide any feedback or contributions. For questions or issues, please open an issue on this repository or contact me directly. Happy experimenting!