Skip to content

Instantly share code, notes, and snippets.

@fangkuoyu
Last active April 7, 2024 15:39
Show Gist options
  • Save fangkuoyu/e74fdfb5833a6b443d04659ff2f7972a to your computer and use it in GitHub Desktop.
Save fangkuoyu/e74fdfb5833a6b443d04659ff2f7972a to your computer and use it in GitHub Desktop.
Model Data Training Method Library Colab Local
Transformer PEFT TRL
TinyLLaMA Finetuning PEFT STFTrainer
LoRA SFTTrainer
QLoRA SFTTrainer T4 1660Ti
Alignment RLHF/PPO PPOTrainer
DPO DPOTrainer 1660Ti
GPT2 IMDB Aligment RLHF/PPO yes no PPOTrainer T4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment