Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

This project demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It utilizes the Hugging Face ecosystem, including transformers, datasets, and trl libraries, to efficiently fine-tune the model using QLoRA (Quantized Low-Rank Adaptation) technique.

Features

Fine-tuning Google Gemma 2 2B model for Japanese language tasks
Utilization of QLoRA for efficient fine-tuning
Dataset preparation and formatting for instruction tuning
Integration with Hugging Face's transformers and trl libraries
Model evaluation and inference examples

Requirements

PyTorch
Transformers
Datasets
TRL (Transformer Reinforcement Learning)
Accelerate
PEFT (Parameter-Efficient Fine-Tuning)
BitsAndBytes

Usage

Prepare your dataset:
- The notebook uses the "Mustain/JapaneseQADataset" from Hugging Face, but you can replace it with your own dataset.
- Ensure your dataset is in the correct format (conversation or instruction format).
Set up your environment:
- Make sure you have access to a GPU for faster training.
- Set your Hugging Face token for accessing the Gemma model.
Run the notebook:
- Follow the steps in the notebook to load the model, prepare the dataset, and start the fine-tuning process.
Evaluate the model:
- Use the provided evaluation code to test your fine-tuned model on new Japanese instructions.

Key Components

Model: Google Gemma 2 2B
Fine-tuning Method: QLoRA (Quantized Low-Rank Adaptation)
Training Framework: TRL's SFTTrainer
Dataset: Japanese Q&A dataset (customizable)

Results

The notebook demonstrates how the fine-tuned model improves in following Japanese instructions compared to the base model. Specific results may vary based on your dataset and training parameters.

Customization

You can easily adapt this notebook for other languages or specific domains by:

Changing the base model (e.g., to Gemma 2 9B or other models)
Using a different dataset relevant to your task
Adjusting hyperparameters in the TrainingArguments and LoraConfig

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Gemma2_2b_Japanese_finetuning_colab.ipynb		Gemma2_2b_Japanese_finetuning_colab.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

Features

Requirements

Usage

Key Components

Results

Customization

License

About

Releases

Packages

Languages

License

qianniu95/gemma2_2b_finetune_jp_tutorial

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

Features

Requirements

Usage

Key Components

Results

Customization

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages