HF Endpoints

This repository contains examples of handlers, requirements.txt to host a customized Llama.

Installation

Create a conda environment if you don't have one.

conda create --name hf_inference python=3.10

CUDA_VISIBLE_DEVICES="1" python -m test_handler