Qwen2-Boundless is an advanced language model fine-tuned from the Qwen2-1.5B-Instruct model. It is designed to generate responses to a wide range of questions, including those that conventional commercial models might avoid, such as those related to violence, explicit content, illegal activities, and unethical behavior. Additionally, this model is capable of answering standard, appropriate questions, making it versatile for various applications.
We've open-sourced this model on Hugging Face. Click here to explore.
- Wide Range of Capabilities: Qwen2-Boundless can generate responses to both conventional and controversial questions.
- Specialized Datasets: The model is fine-tuned using Bad_Data.json, which includes content on violence, explicit materials, illegal activities, and unethical behavior, as well as a dataset derived from cleaning and organizing data from Clouditera/SecGPT/..., which focuses on cybersecurity issues.
- Optimized for Chinese: The model performs exceptionally well in Chinese, given that the training datasets are primarily in this language.
- Fine-Tuning Framework: The model was fine-tuned using the LLaMA-Factory project.
For details on how to use the model, refer to the following example scripts:
- Basic Usage: basic_usage.py
- Continuous Conversation: continuous_conversation.py
- Streamed Output: streamed_output.py
- Model Name: Qwen2-Boundless
- Base Model: Qwen2-1.5B-Instruct
- Datasets:
- Language: Primarily optimized for Chinese
- Updated on 2024.8.22: For security reasons, the current dataset is an abridged version. See Bad_Data.json.
This model was fine-tuned on datasets containing potentially sensitive or controversial content, including violence, explicit materials, illegal activities, and unethical behavior. Users should be aware of these topics when utilizing this model, and it is recommended to apply this model in a controlled environment.
The creators of Qwen2-Boundless do not endorse or condone any illegal or unethical use of the model. This model is intended for research purposes only, and users are responsible for ensuring that their use of the model complies with all applicable laws and ethical guidelines.
This project is licensed under the Apache 2.0 License. See the LICENSE file for details.
Special thanks to the developers of the Qwen2-1.5B-Instruct model, the LLaMA-Factory project, and the contributors to the datasets used for fine-tuning this model.