Existing generative AI models are built on batch processing: You give the system instructions; it runs computations; then ...