gpo.zugaina.org

Search Portage & Overlays:

dev-python/trl-fpo

Train transformer language models with reinforcement learning.

Screenshots

  • trl-fpo-0.0.15
    ~amd64 ~x86
    benchmark deepspeed dev diffusers llm-judge peft quantization test python_targets_python3_11 python_targets_python3_12 python_targets_python3_13 python_targets_python3_14

    View      Download      Browse     License: Apache-2.0   
    Overlay: pypi

ChangeLog

ChangeLog Not Found