Search Portage & Overlays:
Newest
News
Repository news
GLSAs
Browse
USE Flags
Overlays
More...
dev-python
/trl-fpo
Train transformer language models with reinforcement learning.
Screenshots
https://github.com/huggingface/trl
trl-fpo-0.0.15
~amd64 ~x86
benchmark deepspeed dev diffusers llm-judge peft quantization test python_targets_python3_11 python_targets_python3_12 python_targets_python3_13 python_targets_python3_14
View
Download
Browse
License: Apache-2.0
Overlay:
pypi
ChangeLog
USE Flags
Dependencies
Reverse Deps
Related Bugs