About the job
...training data distribution. Preference Model creates reinforcement learning environments that encapsulate real-world use cases, enabling... ...for the models.
About the Role
We're seeking experienced ML engineers to build distributed training infrastructure for our RL...
Read more on Jooble →