aralroca

aralroca

AI agents shouldn't control your apps; they should BE the app

I wrote about a different approach to AI agents. Instead of building bots that screenshot your screen and click buttons (like OpenAI Operator or Anthropic Computer Use), I built 304 browser tools where the AI agent IS the app; calling local functions directly instead of navigating UIs.
Everything runs client-side: JavaScript for simple tasks, Rust compiled to WASM for heavy computation, and ONNX/TensorFlow.js models for AI features like background removal.

Full article:

https://kitmul.com/en/blog/building-kitmul-ai-tools-os

Curious what you all think; is the “AI controlling existing apps” approach fundamentally flawed, or am I missing something?

Where Next?

Popular Ai topics Top

tonyxrandall
The rapid development of AI (artificial intelligence) has opened up new ethical frontiers at a startling pace. As the impact of AI is so ...
New
RobertRichards
In the early days, online gaming was limited to the screens of smartphones, PCs, tablets, and other devices. However, with the advent of ...
New
masterhood13
I just published an article detailing my journey in building a Dota 2 Match Outcome Predictor using machine learning and data analysis. I...
New
masterhood13
[Project Update] Part 2 of My Dota 2 Match Outcome Predictor – Now Available! Hey DevTalk community! I just published the second part o...
New
ozornin
I recently wrote an essay on effects of AI, how it all can become real bad, and how we can avoid it. I started writing the post intend...
New
waseigo
Top-tier LLMs, Rust and Erlang NIFs; nifty, and night and day vs. C, but let me tell you about vibe coding… After I submitted my blog po...
New
vipulbhj
Agents execute at scale. Accountability doesn’t transfer. The founder who delegates everything to AI doesn’t become a CEO with thousands ...
New
xiji2646-netizen
You guys aren’t gonna believe this. Anthropic‘s engineers just dropped a goldmine — a deep dive into how they’re actually using Claude C...
New
aralroca
I wrote about a different approach to AI agents. Instead of building bots that screenshot your screen and click buttons (like OpenAI Oper...
New
xiji2646-netizen
I have been evaluating the three major AI video generation APIs for a project and figured this might save others some research time. Curi...
New

Other popular topics Top

PragmaticBookshelf
Take your Go skills to the next level by learning how to design, develop, and deploy a distributed service. Start from the bare essential...
New
PragmaticBookshelf
Machine learning can be intimidating, with its reliance on math and algorithms that most programmers don't encounter in their regular wor...
New
ohm
Which, if any, games do you play? On what platform? I just bought (and completed) Minecraft Dungeons for my Nintendo Switch. Other than ...
New
PragmaticBookshelf
From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...
New
AstonJ
We have a thread about the keyboards we have, but what about nice keyboards we come across that we want? If you have seen any that look n...
New
DevotionGeo
The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...
New
AstonJ
Continuing the discussion from Thinking about learning Crystal, let’s discuss - I was wondering which languages don’t GC - maybe we can c...
New
PragmaticBookshelf
Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...
New
AstonJ
If you get Can't find emacs in your PATH when trying to install Doom Emacs on your Mac you… just… need to install Emacs first! :lol: bre...
New
PragmaticBookshelf
Author Spotlight: VM Brasseur @vmbrasseur We have a treat for you today! We turn the spotlight onto Open Source as we sit down with V...
New