Agreed, I beleive they used that during the training of the model.

An inspiring article!
1
1
Wenjiezhou
Vishal Rajput
·Follow
Sep 17, 2024
--
Agreed, I beleive they used that during the training of the model. During inference, it is just using the trajectories learnt by RL at train time.
--
--
Written by Vishal Rajput
19K Followers
·93 Following
3x🏆Top writer in AI | AI Book 📓: https://rb.gy/xc8m46 | LinkedIn +: https://www.linkedin.com/in/vishal-rajput-999164122/ | 𝕏: https://x.com/RealAIGuys
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams