This Week's Trending
Can LLMs control robots? We answer this by testing how good models are at passing the butter – or more generally, do delivery tasks in a ...
New
This Month's Trending
LLM inference that gets faster as you use it. Our runtime-learning accelerator adapts continuously to your workload, delivering 500 TPS o...
New
This Year's Trending
Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language...
New
Comparison and ranking the performance of over 100 AI models (LLMs) across key metrics including intelligence, price, performance and spe...
New
But for what I do use LLMs for, it’s invaluable.
New
Reverse Engineering Cursor’s LLM Client
New
By some appearances, at least, the kernel community has been relatively insulated from the onsl […]
New
One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...
New
CocoIndex now supports knowledge graph with incremental processing. Build live knowledge for agents is super easy with CocoIndex!
New
A misconfiguration that might have cost us $7,000
New
In this technical report, we tackle the challenges of training large-scale Mixture of Experts (MoE) models, focusing on overcoming cost i...
New
Confused by LLMs, RAG, & AI Agents? We break down the spectrum of AI system design with a familiar resume-screening example to show t...
New
Do we have to disclaim how we use AI ?
New
A practical handbook for engineers building, optimizing, scaling and operating LLM inference systems in production.
New
llms.txt is an emerging standard for making content such as docs available for direct consumption by AIs. We’re proposing a convention to...
New
Large Language Models (LLMs) have revolutionized natural language processing, but their varying capabilities and costs pose challenges in...
New
GitHub - NVIDIA/garak: the LLM vulnerability scanner.
the LLM vulnerability scanner. Contribute to NVIDIA/garak development by creating ...
New
Last Three Year's Trending
Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
The rise of large language models (LLMs) has tra...
New
GitHub - intel-analytics/ipex-llm: Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma...
New
GitHub - rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step.
Implementing a ChatGPT-like LLM from scrat...
New
GitHub - google/maxtext: A simple, performant and scalable Jax LLM!.
A simple, performant and scalable Jax LLM! Contribute to google/max...
New
Code LoRA from Scratch - a Lightning Studio by sebastian.
LoRA (Low-Rank Adaptation) is a popular technique to finetune LLMs more effici...
New
Home | ArtificialAnalysis.ai.
Analysis of AI models and hosting providers - choose the best model and provider for your use case
New
Hello from Scrapegraph-ai | Scrapegraph-ai.
Official documentation of Scrapegraph-ai
New
Kindllm - LLM chat for Kindle.
The distraction-free LLM chat app for Kindle
New
AMD’s MI300X Outperforms NVIDIA’s H100 for LLM Inference.
Discover if AMD’s MI300X accelerator can outperform NVIDIA’s H100 in real-worl...
New
Hello Qwen2.
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD
Introduction After months of efforts, we are pleased to announce the evolution...
New
GitHub - google-deepmind/recurrentgemma: Open weights language model from Google DeepMind, based on Griffin…
Open weights language model...
New
Jamba: A Hybrid Transformer-Mamba Language Model.
We present Jamba, a new base large language model based on a novel hybrid Transformer-...
New
Use Prolog to improve LLM’s reasoning.
On one side, LLMs show unseen capabilities in reasoning, but on the other - reasoning in LLMs is ...
New
Get consistent data from your LLM with JSON Schema.
How to parse content from a tool that is made to speak in human sentences.
New
Building an early warning system for LLM-aided biological threat creation.
We’re developing a blueprint for evaluating the risk that a l...
New
Trending Over Three Years
Forest Friends Zine.
A guide for AI Engineers building the wild world of LLM system evals
New
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples.
We analyze how well pre...
New
What would an LLM OS look like?.
Andrej Karpathy’s YouTube channel is fantasic. He just published an Intro to Large Language Models vide...
New
Can GPT Optimize My Taxes?.
TL;DR Yep.
New
LLM inference speed of light.
In the process of working on calm, a minimal from-scratch fast CUDA implementation of transformer-based la...
New
GitHub - apple/ml-mgie.
Contribute to apple/ml-mgie development by creating an account on GitHub.
New
Aya.
Cohere’s non-profit research lab, C4AI, released the Aya model, a state-of-the-art, open source, massively multilingual, research L...
New
Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large ...
New
DRINK ME: (Ab)Using a LLM to compress text.
Introduction
Large language models are trained on huge datasets of text to learn the relat...
New
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.
In this work, we discuss building performant Multimodal Large La...
New
Wikipedia Citation Needed.
A chrome extension for finding citations in Wikipedia by using ChatGPT
New
If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...
New
Rethinking LLM Inference: Why Developer AI Needs a Different Approach.
A technical blog post from Augment Code explaining their approach...
New
How local, learnable routers can reduce token overhead, lower costs, and bring structure back to agentic workflows.
New
My experience creating software with LLM coding agents - Part 2 This post details my experiences creating software with LLM coding agents...
New
Get money off!
The Pragmatic Bookshelf
35% off any eBook
Manning Publications
45% off any item
The Pragmatic Studio
20% off any course
Simply use coupon code "devtalk.com" at checkout. Where applicable this coupon can be used for an many items and as many times as you like!
Filter by Type:
Popular Tags
- #apple
- #code
- #programming
- #linux
- #web
- #blog-post
- #podcasts
- #video
- #news
- #otp
- #community
- #chatgpt
- #macos
- #microsoft
- #openai
- #learning
- #new
- #github
- #development
- #design
- #database
- #project
- #performance
- #ios
- #testing
- #internet
- #manning
- #css
- #android
- #apps
- #quantum
- #ai
- #hardware
- #guide
- #nvidia
- #intel
- #browser
- #amazon
- #liveview
- #blog
- #privacy
- #musk
- #llm
- #social
- #writing
- #games
- #windows
- #api
Popular Portals
- /elixir
- /rust
- /ruby
- /wasm
- /erlang
- /phoenix
- /keyboards
- /python
- /rails
- /js
- /security
- /go
- /swift
- /vim
- /clojure
- /haskell
- /emacs
- /java
- /svelte
- /onivim
- /typescript
- /kotlin
- /c-plus-plus
- /crystal
- /tailwind
- /react
- /gleam
- /ocaml
- /elm
- /flutter
- /vscode
- /ash
- /html
- /opensuse
- /centos
- /php
- /zig
- /deepseek
- /scala
- /sublime-text
- /lisp
- /textmate
- /react-native
- /debian
- /nixos
- /agda
- /kubuntu
- /arch-linux
- /django
- /revery






