Thanks for the information. I know Google had TPU custom made a long time ago, a...

		UI_at_80x24 on Sept 26, 2024 \| parent \| context \| favorite \| on: OpenAI to become for-profit company Thanks for the information. I know Google had TPU custom made a long time ago, and that the concept has existed for a LONG TIME. I assumed that a technical hurdle (i.e. VRAM) was finally behind allowing this theoretical (1 token/sec on a CPU vs 100 tokens/sec on a GPU) to become reasonable. Thanks for the links too!