Created
March 9, 2024 14:08
-
-
Save ruvnet/1d03bba3ebb00a16e3931125a78a755c to your computer and use it in GitHub Desktop.
Self reasoning framework
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Considering how methods like "Pass@K" training and "Dr. GRPO" are de-biasing LLM thinking, I wonder if these methods also work with prompt engineering. The art of this would be to know which method to use for which specific task types. https://github.com/RUCAIBox/Passk_Training https://github.com/sail-sg/understand-r1-zero