tech, startups, internet

What is ChatGPT doing and why does it work?

Written by Stephen Wolfram, this is a technical explanation about how ChatGPT and Large Language Models work. He builds up from the ground floor, providing simplified mathematical examples that are somewhat approachable to regular people. But it is Stephen Wolfram, who is […]

tech, startups, internet

And here come multimodal Chain of Thought.

Amazon researchers paired text with images as input to drive LLM ability ever higher. And they did it with a model that’s only 770 million params (vs the 175 Billion in GPT-3.5). “Our method achieves new state-of-the-art performance on the […]