跳转到主要内容

AITraining home page

GitHub
GitHub

GitHub
PyPI
Discord

训练技术

提示词蒸馏
DPO 训练
ORPO 训练
PPO 训练
GRPO 训练
奖励建模
RL 训练模块

优化

超参数扫描
量化
LoRA & PEFT
Unsloth 集成
Flash Attention
Gradient checkpointing

自定义开发

Custom metrics
Custom losses
Custom trainers
Custom datasets

评估

Evaluation framework
Benchmark suites
Metric computation
Model comparison

研究

Experiment tracking
Ablation studies
Reproducibility
Paper implementations

生产

Scaling training
分布式训练
Model optimization
Serving at scale

404

Page Not Found

We couldn't find the page. Maybe you were looking for one of these pages below?

Reward Modeling DPO Training Training Tasks