Goal: To create the best possible Amharic language models that can run offline on a budget smartphone.
All resources created from this repository will be stored in amharic_llama huggingface collection.
Unless otherwise stated, notebooks through Google Colab are the primary supported way of running everything here.
- Create a robust evaluation framework to understand current Amharic LLM capabilities.
- Develop the SoA Amharic language models for <3B and <10B parameters.
- Develop an offline AI app to enable broad distribution for Ethiopians running on low RAM hardware.
- Progress to Oromo, Tigrinya, Afar and Somali languages.
Objective: Understand the current SOA in Amharic
- Fine tune various SOA english LLMs on Amharic translated datasets to provide a set of control models.
- Develop an open Amharic LLM Leaderboard for model evaluation
- Translate Evaluation Datasets - Amharic GSM8K, Amharic TruthfulQA, ARC, HellaSwag, MMLU, Winogrande
- Create lm-evaluation-harness tasks for each translated dataset
- Create an Amharic LLM leaderboard to visualise all existing model capabilities.
Objective: Use EEVE vocabulary expansion to train the SOA Amharic language model.