aralroca

aralroca

AI agents shouldn't control your apps; they should BE the app

I wrote about a different approach to AI agents. Instead of building bots that screenshot your screen and click buttons (like OpenAI Operator or Anthropic Computer Use), I built 304 browser tools where the AI agent IS the app; calling local functions directly instead of navigating UIs.
Everything runs client-side: JavaScript for simple tasks, Rust compiled to WASM for heavy computation, and ONNX/TensorFlow.js models for AI features like background removal.

Full article:

https://kitmul.com/en/blog/building-kitmul-ai-tools-os

Curious what you all think; is the “AI controlling existing apps” approach fundamentally flawed, or am I missing something?

Where Next?

Popular Ai topics Top

tonyxrandall
The rapid development of AI (artificial intelligence) has opened up new ethical frontiers at a startling pace. As the impact of AI is so ...
New
tonyxrandall
According to the latest data, the number of AI-related jobs has increased by a gigantic 119% since 2015. What’s more, as more companies i...
New
tonyxrandall
Artificial intelligence is a booming industry. This is a great chance to find just the right job for you. The tips and ideas below will h...
#ai
New
RobertRichards
In the early days, online gaming was limited to the screens of smartphones, PCs, tablets, and other devices. However, with the advent of ...
New
masterhood13
I just published an article detailing my journey in building a Dota 2 Match Outcome Predictor using machine learning and data analysis. I...
New
ozornin
I recently wrote an essay on effects of AI, how it all can become real bad, and how we can avoid it. I started writing the post intend...
New
John-BoothIQ
TL;DR: Good: AI is great at Elixir. It gets better as your codebase grows. Bad: It defaults to defensive, imperative code. You need...
New
aralroca
I wrote about a different approach to AI agents. Instead of building bots that screenshot your screen and click buttons (like OpenAI Oper...
New
xiji2646-netizen
I have been evaluating the three major AI video generation APIs for a project and figured this might save others some research time. Curi...
New
xiji2646-netizen
Anthropic announced Claude Mythos Preview this week – and then said it will not release it to the public. Their reasoning: the model’s cy...
New

Other popular topics Top

AstonJ
If it’s a mechanical keyboard, which switches do you have? Would you recommend it? Why? What will your next keyboard be? Pics always w...
New
AstonJ
You might be thinking we should just ask who’s not using VSCode :joy: however there are some new additions in the space that might give V...
New
New
AstonJ
This looks like a stunning keycap set :orange_heart: A LEGENDARY KEYBOARD LIVES ON When you bought an Apple Macintosh computer in the e...
New
AstonJ
In case anyone else is wondering why Ruby 3 doesn’t show when you do asdf list-all ruby :man_facepalming: do this first: asdf plugin-upd...
New
Margaret
Hello everyone! This thread is to tell you about what authors from The Pragmatic Bookshelf are writing on Medium.
1147 29994 760
New
PragmaticBookshelf
Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
AstonJ
If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...
New
PragmaticBookshelf
A concise guide to MySQL 9 database administration, covering fundamental concepts, techniques, and best practices. Neil Smyth MySQL...
New