Chain of Thoughtlessness.
This research makes the claim that using “Chain of Thought” prompting for LLMs does not generalize. I consider this evidence of LLM limitations (at least for now?). “This spotlights drawbacks of chain of thought, especially because of the sharp tradeoff between possible performance gains and the amount of human labor necessary to generate examples with correct reasoning traces.” ~ learn more