🌸 Zephyr-7B LoRA SFT - Taiwanese HR Instruction Fine-tuned Model 中文職場 HR 指令微調版 (zephyr-7b-lora-debby-v1)

This model is a LoRA fine-tuned version of HuggingFaceH4/zephyr-7b-beta, trained on a small set of custom-curated, instruction-style prompts written in Traditional Chinese, focused on human resource (HR) scenarios in Taiwan.

It was developed as part of a larger vision to build human-centered AI systems that can assist real-world organizational workflows — from resume screening to labor law guidance — especially in Taiwanese and Mandarin-speaking environments.

本模型以 HuggingFaceH4/zephyr-7b-beta 為基礎,進行少量繁體中文資料的 Supervised Fine-tuning(SFT),資料主題涵蓋台灣職場情境下的 HR 指令回應、招募分析、履歷判讀與職涯建議。

此為實驗性模型,主要用途為多模態 AI 系統中的語言理解模組、Chatbot 原型驗證與人本 NLP 任務應用。

🔧 Training Configuration / 訓練細節

  • 🧠 Base model: zephyr-7b-beta
  • 🛠️ Fine-tuning method: PEFT (LoRA)
  • 🎯 Target modules: ["q_proj", "v_proj"]
  • 🔢 Precision: FP16 (no int8/4bit quantization)
  • ⏳ Epochs: 1
  • 🧾 Dataset: Self-built instruction-format dataset (JSONL)
  • 💻 Platform: Google Colab + A100 GPU

📝 Sample Input Format (JSONL) / 樣例格式

Each training example includes instruction, input, and output. The final training data was converted into a single text field, like this:

{
"instruction": "請協助我評估這份履歷與 JD 的相符程度,並簡要說明優勢與不足。",
"input": "履歷:具三年 Python 資料分析經驗、熟悉 SQL 與 Tableau。\\\\nJD:需要資料分析能力,需熟悉 Python、SQL,有儀表板經驗佳。",
"output": "相符程度:高度相符。\\\\n優勢:履歷中具備 Python 與 SQL 能力,並有 Tableau 經驗,符合 JD 要求。\\\\n不足:JD 若有提及特定產業經驗(如金融/製造),履歷中未提及。",
"category": "HR-1"
}

-for English researchers, here's the translation of sample training example:

{
"instruction:": "Please evaluate the resume against the job description and briefly explain the strengths and weaknesses.",
"input": "Resume: 3 years of Python data analysis experience, familiar with SQL and Tableau.\\\\nJD: Requires data analysis skills, Python/SQL, dashboard experience preferred.",
"output": "Match level: High.\\\\nStrengths: Candidate has Python and SQL skills and Tableau experience, which matches JD.\\\\nWeaknesses: Lacks specific industry experience if JD requires it (e.g., finance or manufacturing).",
"category": "HR-1"}

✅ Use Cases / 適用場景

  • Chinese instruction-following agent / 中文聊天機器人
  • AI assistants for recruitment and workplace advisory / HR FAQ招募建議與履歷對話
  • Natural Language modules within multimodal HR analytics systems / 多模態 AI 系統之文字理解模組(如搭配 Whisper、DeepFace)

🚧 Limitations & Disclaimer / 聲明

This model was trained with a small custom dataset and should not be used for legal or HR decisions without human supervision. It's best used for: - Prototype building - NLP pipeline experimentation - Human-centered AI co-creation research

本模型為研究測試用途,訓練資料來源為自建模擬資料,不應用於正式法律或人力判斷情境。建議搭配人類判讀驗證再應用於實務。

🙏 Acknowledgements

  • Hugging Face PEFT, TRL, and Transformers teams
  • Google Colab
  • Taiwan’s HR field, which inspired this effort
  • The inner vow that AI should empower both systems and souls

If you are a researcher, engineer, or HR innovator interested in co-developing fair and human-centered AI tools, feel free to reach out via LinkedIn or my portfolio.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DEBBY-YEH/zephyr-7b-lora-debby-v1

Finetuned
(141)
this model

Space using DEBBY-YEH/zephyr-7b-lora-debby-v1 1