Instructions to use tencent/HY-MT1.5-7B-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tencent/HY-MT1.5-7B-GGUF with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="tencent/HY-MT1.5-7B-GGUF") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("tencent/HY-MT1.5-7B-GGUF", dtype="auto") - llama-cpp-python
How to use tencent/HY-MT1.5-7B-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tencent/HY-MT1.5-7B-GGUF", filename="HY-MT1.5-7B-Q4_K_M.gguf", )
llm.create_chat_completion( messages = "\"Меня зовут Вольфганг и я живу в Берлине\"" )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tencent/HY-MT1.5-7B-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf tencent/HY-MT1.5-7B-GGUF:Q4_K_M
Use Docker
docker model run hf.co/tencent/HY-MT1.5-7B-GGUF:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use tencent/HY-MT1.5-7B-GGUF with Ollama:
ollama run hf.co/tencent/HY-MT1.5-7B-GGUF:Q4_K_M
- Unsloth Studio
How to use tencent/HY-MT1.5-7B-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tencent/HY-MT1.5-7B-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tencent/HY-MT1.5-7B-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tencent/HY-MT1.5-7B-GGUF to start chatting
- Docker Model Runner
How to use tencent/HY-MT1.5-7B-GGUF with Docker Model Runner:
docker model run hf.co/tencent/HY-MT1.5-7B-GGUF:Q4_K_M
- Lemonade
How to use tencent/HY-MT1.5-7B-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tencent/HY-MT1.5-7B-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.HY-MT1.5-7B-GGUF-Q4_K_M
List all available models
lemonade list
🤗 Hugging Face | 🕹️ Demo 🤖 ModelScope |
🖥️ Official Website | Github
Model Introduction
Hunyuan Translation Model Version 1.5 includes a 1.8B translation model, HY-MT1.5-1.8B, and a 7B translation model, HY-MT1.5-7B. Both models focus on supporting mutual translation across 33 languages and incorporating 5 ethnic and dialect variations. Among them, HY-MT1.5-7B is an upgraded version of our WMT25 championship model, optimized for explanatory translation and mixed-language scenarios, with newly added support for terminology intervention, contextual translation, and formatted translation. Despite having less than one-third the parameters of HY-MT1.5-7B, HY-MT1.5-1.8B delivers translation performance comparable to its larger counterpart, achieving both high speed and high quality. After quantization, the 1.8B model can be deployed on edge devices and support real-time translation scenarios, making it widely applicable.
Key Features and Advantages
- HY-MT1.5-1.8B achieves the industry-leading performance among models of the same size, surpassing most commercial translation APIs.
- HY-MT1.5-1.8B supports deployment on edge devices and real-time translation scenarios, offering broad applicability.
- HY-MT1.5-7B, compared to its September open-source version, has been optimized for annotated and mixed-language scenarios.
- Both models support terminology intervention, contextual translation, and formatted translation.
Related News
- 2025.12.30, we have open-sourced HY-MT1.5-1.8B and HY-MT1.5-7B on Hugging Face.
- 2025.9.1, we have open-sourced Hunyuan-MT-7B , Hunyuan-MT-Chimera-7B on Hugging Face.
Performance
Model Links
| Model Name | Description | Download |
|---|---|---|
| HY-MT1.5-1.8B | Hunyuan 1.8B translation model | 🤗 Model |
| HY-MT1.5-1.8B-FP8 | Hunyuan 1.8B translation model, fp8 quant | 🤗 Model |
| HY-MT1.5-1.8B-GPTQ-Int4 | Hunyuan 1.8B translation model, int4 quant | 🤗 Model |
| HY-MT1.5-7B | Hunyuan 7B translation model | 🤗 Model |
| HY-MT1.5-7B-FP8 | Hunyuan 7B translation model, fp8 quant | 🤗 Model |
| HY-MT1.5-7B-GPTQ-Int4 | Hunyuan 7B translation model, int4 quant | 🤗 Model |
Prompts
Prompt Template for ZH<=>XX Translation.
将以下文本翻译为{target_language},注意只需要输出翻译后的结果,不要额外解释:
{source_text}
Prompt Template for XX<=>XX Translation, excluding ZH<=>XX.
Translate the following segment into {target_language}, without additional explanation.
{source_text}
Prompt Template for terminology intervention.
参考下面的翻译:
{source_term} 翻译成 {target_term}
将以下文本翻译为{target_language},注意只需要输出翻译后的结果,不要额外解释:
{source_text}
Prompt Template for contextual translation.
{context}
参考上面的信息,把下面的文本翻译成{target_language},注意不需要翻译上文,也不要额外解释:
{source_text}
Prompt Template for formatted translation.
将以下<source></source>之间的文本翻译为中文,注意只需要输出翻译后的结果,不要额外解释,原文中的<sn></sn>标签表示标签内文本包含格式信息,需要在译文中相应的位置尽量保留该标签。输出格式为:<target>str</target>
<source>{src_text_with_format}</source>
Use with llama.cpp
llama-cli -hf tencent/HY-MT1.5-7B-GGUF:Q8_0 -p "Translate the following segment into Chinese, without additional explanation.\n\nIt’s on the house." -n 4096 --temp 0.7 --top-k 20 --top-p 0.6 --repeat-penalty 1.05 --no-warmup
Use with ollama
echo 'FROM hf.co/tencent/HY-MT1.5-7B-GGUF:Q8_0\nTEMPLATE """{{ if .System }}<|startoftext|>{{ .System }}<|extra_4|>{{ end }}{{ if .Prompt }}<|startoftext|>{{ .Prompt }}<|extra_0|>{{ end }}{{ .Response }}<|eos|>"""' > Modelfile
ollama create hy-mt1.5-7b -f Modelfile
ollama run hy-mt1.5-7b
We recommend using the following set of parameters for inference. Note that our model does not have the default system_prompt.
{
"top_k": 20,
"top_p": 0.6,
"repetition_penalty": 1.05,
"temperature": 0.7
}
Supported languages:
| Languages | Abbr. | Chinese Names |
|---|---|---|
| Chinese | zh | 中文 |
| English | en | 英语 |
| French | fr | 法语 |
| Portuguese | pt | 葡萄牙语 |
| Spanish | es | 西班牙语 |
| Japanese | ja | 日语 |
| Turkish | tr | 土耳其语 |
| Russian | ru | 俄语 |
| Arabic | ar | 阿拉伯语 |
| Korean | ko | 韩语 |
| Thai | th | 泰语 |
| Italian | it | 意大利语 |
| German | de | 德语 |
| Vietnamese | vi | 越南语 |
| Malay | ms | 马来语 |
| Indonesian | id | 印尼语 |
| Filipino | tl | 菲律宾语 |
| Hindi | hi | 印地语 |
| Traditional Chinese | zh-Hant | 繁体中文 |
| Polish | pl | 波兰语 |
| Czech | cs | 捷克语 |
| Dutch | nl | 荷兰语 |
| Khmer | km | 高棉语 |
| Burmese | my | 缅甸语 |
| Persian | fa | 波斯语 |
| Gujarati | gu | 古吉拉特语 |
| Urdu | ur | 乌尔都语 |
| Telugu | te | 泰卢固语 |
| Marathi | mr | 马拉地语 |
| Hebrew | he | 希伯来语 |
| Bengali | bn | 孟加拉语 |
| Tamil | ta | 泰米尔语 |
| Ukrainian | uk | 乌克兰语 |
| Tibetan | bo | 藏语 |
| Kazakh | kk | 哈萨克语 |
| Mongolian | mn | 蒙古语 |
| Uyghur | ug | 维吾尔语 |
| Cantonese | yue | 粤语 |
- Downloads last month
- 6,392
4-bit
6-bit
8-bit