createos

createos

How are you handling the full deployment lifecycle for AI workloads in production?

Curious how other teams are approaching this.

Building an AI app used to mean picking a model and writing product logic. Now it means also picking a hosting provider, wiring up a monitoring tool, and at some point figuring out billing. Three separate systems, each with their own failure modes, each needing maintenance.

The pattern I keep seeing: teams ship something that works in staging, then spend the next month firefighting the infrastructure around it. A monitoring alert lags the actual incident by 10+ minutes. The billing integration breaks when usage spikes. The hosting layer that worked for a prototype can not handle real traffic.

Some specific questions for anyone running AI workloads in production:

  • Are you managing hosting, monitoring, and billing as separate systems or have you consolidated them?

  • If separate, how much engineering time per week goes into keeping those integrations running vs. building the actual product?

  • Have you looked at managed execution layers as an alternative to self-building this stack?

We ran into this problem ourselves while building CreateOS (createos.sh), which ended up being our answer to it. But I am more interested in how others are solving it, or whether the problem is even the same across different team sizes.

Where Next?

Popular General Dev topics Top

KnowledgeIsPower
I had written an article about container sand-boxing. Please give comments or feedback, thanks! Sandboxing containers
New
KnowledgeIsPower
I had written an article about Migrate a K3S cluster storage from Rook to OpenEBS, with Velero Please give comment or feedback, thanks!
New
KnowledgeIsPower
I had written an article about Using mcrouter and memcached as caching layer for Thanos Store Please give comment or feedback, thanks!
New
New
avinashmeena
What are the benefits of using Linux hosting for small businesses? How does Linux hosting compare to other hosting solutions in terms of ...
New
akinihsan
I am trying to use following kubernetes ingress service to host dashboard in the server. it doesnt host the administration dashboard. How...
New
akinihsan
Hello Do you use static code analysiss tool on your ci cd pipeline? Which one do u use and what are the benefits for you
New
tasaraskam
Hey everyone, I’m a DevOps engineer and I noticed that most developers either don’t know DevOps or hate setting up CI/CD and Docker. So...
New
runmyjob
Hi, We’re building EU-based CI runners with load-based billing at RunMyJob and I’d love to benchmark them against real-world CI setups. ...
New
createos
Curious how other teams are approaching this. Building an AI app used to mean picking a model and writing product logic. Now it means al...
New

Other popular topics Top

PragmaticBookshelf
Take your Go skills to the next level by learning how to design, develop, and deploy a distributed service. Start from the bare essential...
New
PragmaticBookshelf
Andy and Dave wrote this influential, classic book to help their clients create better software and rediscover the joy of coding. Almost ...
New
PragmaticBookshelf
Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...
New
PragmaticBookshelf
Write Elixir tests that you can be proud of. Dive into Elixir’s test philosophy and gain mastery over the terminology and concepts that u...
New
AstonJ
If you want a quick and easy way to block any website on your Mac using Little Snitch simply… File > New Rule: And select Deny, O...
New
New
PragmaticBookshelf
Develop, deploy, and debug BEAM applications using BEAMOps: a new paradigm that focuses on scalability, fault tolerance, and owning each ...
New
CommunityNews
A Brief Review of the Minisforum V3 AMD Tablet. Update: I have created an awesome-minisforum-v3 GitHub repository to list information fo...
New
AstonJ
Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...
New
xiji2646-netizen
Woke up to this today: Claude Code’s complete source code exposed via npm source map. Not a snippet. All 512,000 lines. 1,900 TypeScript ...
New