This is a technology demonstration to show that it is possible to run tiny (large) language models on a cheap and scalable serverless platform. Not for production use.