Skip to content

Latest commit

 

History

History
16 lines (12 loc) · 974 Bytes

README.md

File metadata and controls

16 lines (12 loc) · 974 Bytes

Project Template Generation

The challenge is to generate project template -- small compilable project that can be described in 1-5 sentences containing small examples of all mentioned libraries/technologies/functionality.

Dataset

Project from GitHub written in Java and Kotlin programming languages with 10+ stars and 10+ code lines, permissive licences, without forks (collected by https://seart-ghs.si.usi.ch) filtered by is_template=True or template-related keywords words presence in description. From Java and Kotlin the Android projects were identified by android token in description or tags and moved to separate category.

Collected data is available in HuggingFace 🤗, data was manually labeled to select test subset in Google Sheets