AI & ML interests

None defined yet.

Recent Activity

qgallouedec  updated a dataset about 14 hours ago
trl-lib/trl-download-stats
qgallouedec  updated a Space 22 days ago
trl-lib/trl-download-stats
qgallouedec  published a Space 22 days ago
trl-lib/trl-download-stats
View all activity

trl-lib 's collections 7

Comparing DPO with IPO and KTO
A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO.