Latest #llm Threads 

Thinking Elixir 260: Cheaper testing with AI?
Episode 260 of Thinking Elixir. News includes LiveDebugger v0.3.0 with enhanced debugging ...
New

How local, learnable routers can reduce token overhead, lower costs, and bring structure back to agentic workflows.
New

I just came across this tool. It looks interesting.
Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platfor...
New

A new openSUSE blog post/announcement has been posted!
Get the full details here: SUSE Refines, Releases Open-Source LLM to Fuel Commun...
New

Confused by LLMs, RAG, & AI Agents? We break down the spectrum of AI system design with a familiar resume-screening example to show t...
New

: 6-in-10 success rate for single-step tasks
New

Reverse Engineering Cursor’s LLM Client
New

How the Elixir community can survive — and thrive — in an age of LLMs.
New

If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...
New

One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...
New
This Week's Trending

I just came across this tool. It looks interesting.
Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platfor...
New

Thinking Elixir 260: Cheaper testing with AI?
Episode 260 of Thinking Elixir. News includes LiveDebugger v0.3.0 with enhanced debugging ...
New

How local, learnable routers can reduce token overhead, lower costs, and bring structure back to agentic workflows.
New
This Month's Trending

Confused by LLMs, RAG, & AI Agents? We break down the spectrum of AI system design with a familiar resume-screening example to show t...
New

A new openSUSE blog post/announcement has been posted!
Get the full details here: SUSE Refines, Releases Open-Source LLM to Fuel Commun...
New

: 6-in-10 success rate for single-step tasks
New
This Year's Trending

Loads of news stories about DeepSeek here in the last few days, no surprise as it’s been making headlines across the world! Currently a h...
New

But for what I do use LLMs for, it’s invaluable.
New

How the Elixir community can survive — and thrive — in an age of LLMs.
New

Reverse Engineering Cursor’s LLM Client
New

Use Prolog to improve LLM’s reasoning.
On one side, LLMs show unseen capabilities in reasoning, but on the other - reasoning in LLMs is ...
New

A new Go blog post/announcement has been posted!
Get the full details here: Building LLM-powered applications in Go - The Go Programmin...
New

CocoIndex now supports knowledge graph with incremental processing. Build live knowledge for agents is super easy with CocoIndex!
New

In this technical report, we tackle the challenges of training large-scale Mixture of Experts (MoE) models, focusing on overcoming cost i...
New

A misconfiguration that might have cost us $7,000
New

One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...
New

If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...
New

My LLM codegen workflow atm.
A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through ...
New

GitHub - NVIDIA/garak: the LLM vulnerability scanner.
the LLM vulnerability scanner. Contribute to NVIDIA/garak development by creating ...
New

Offline Reinforcement Learning for LLM Multi-Step Reasoning.
Improving the multi-step reasoning ability of large language models (LLMs) ...
New

Forest Friends Zine.
A guide for AI Engineers building the wild world of LLM system evals
New
Last Three Year's Trending

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
The rise of large language models (LLMs) has tra...
New

GitHub - intel-analytics/ipex-llm: Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma...
New

Code LoRA from Scratch - a Lightning Studio by sebastian.
LoRA (Low-Rank Adaptation) is a popular technique to finetune LLMs more effici...
New

GitHub - rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step.
Implementing a ChatGPT-like LLM from scrat...
New

Jamba: A Hybrid Transformer-Mamba Language Model.
We present Jamba, a new base large language model based on a novel hybrid Transformer-...
New

Hello from Scrapegraph-ai | Scrapegraph-ai.
Official documentation of Scrapegraph-ai
New

Home | ArtificialAnalysis.ai.
Analysis of AI models and hosting providers - choose the best model and provider for your use case
New

Episode 185 of Thinking Elixir. Dive into the world of structured LLM prompting with our latest guest who shares insights on their innova...
New

Kindllm - LLM chat for Kindle.
The distraction-free LLM chat app for Kindle
New

AMD’s MI300X Outperforms NVIDIA’s H100 for LLM Inference.
Discover if AMD’s MI300X accelerator can outperform NVIDIA’s H100 in real-worl...
New

GitHub - google-deepmind/recurrentgemma: Open weights language model from Google DeepMind, based on Griffin…
Open weights language model...
New

Building an early warning system for LLM-aided biological threat creation.
We’re developing a blueprint for evaluating the risk that a l...
New

Get consistent data from your LLM with JSON Schema.
How to parse content from a tool that is made to speak in human sentences.
New

A big barrier to getting started with local AI development is access to hardware. And by “local”, I mean having direct access to a GPU an...
New

LLM inference speed of light.
In the process of working on calm, a minimal from-scratch fast CUDA implementation of transformer-based la...
New
Trending Over Three Years

What would an LLM OS look like?.
Andrej Karpathy’s YouTube channel is fantasic. He just published an Intro to Large Language Models vide...
New

GitHub - google/maxtext: A simple, performant and scalable Jax LLM!.
A simple, performant and scalable Jax LLM! Contribute to google/max...
New

Aya.
Cohere’s non-profit research lab, C4AI, released the Aya model, a state-of-the-art, open source, massively multilingual, research L...
New

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.
In this work, we discuss building performant Multimodal Large La...
New

Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large ...
New

DRINK ME: (Ab)Using a LLM to compress text.
Introduction
Large language models are trained on huge datasets of text to learn the relat...
New

GitHub - apple/ml-mgie.
Contribute to apple/ml-mgie development by creating an account on GitHub.
New

Can GPT Optimize My Taxes?.
TL;DR Yep.
New

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples.
We analyze how well pre...
New

Hello Qwen2.
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD
Introduction After months of efforts, we are pleased to announce the evolution...
New

Alex Strick van Linschoten - How to think about creating a dataset for LLM finetuning evaluation.
I summarise the kinds of evaluations t...
New

New

Wikipedia Citation Needed.
A chrome extension for finding citations in Wikipedia by using ChatGPT
New

How to run an LLM locally on your PC in less than 10 minutes.
Cut through the hype, keep your data private, find out what all the fuss i...
New

Top 9 Libraries to Accelerate LLM Building.
The Open-source Tool Stack to build, scale, test, deploy, and monitor LLMs in 2024.
New
Get money off!

The Pragmatic Bookshelf
35% off any eBook

Manning Publications
45% off any item

The Pragmatic Studio
20% off any course
Simply use coupon code "devtalk.com" at checkout. Where applicable this coupon can be used for an many items and as many times as you like!
Filter by Type:
Popular Tags
- #apple
- #code
- #programming
- #linux
- #web
- #podcasts
- #blog-post
- #video
- #news
- #otp
- #community
- #chatgpt
- #macos
- #new
- #microsoft
- #learning
- #openai
- #github
- #development
- #database
- #design
- #performance
- #ios
- #project
- #testing
- #internet
- #css
- #apps
- #android
- #hardware
- #quantum
- #guide
- #nvidia
- #intel
- #amazon
- #browser
- #liveview
- #musk
- #manning
- #privacy
- #social
- #games
- #ai
- #writing
- #languages
- #windows
- #api
- #tiktok
Popular Portals
- /elixir
- /rust
- /ruby
- /wasm
- /erlang
- /phoenix
- /keyboards
- /rails
- /js
- /python
- /security
- /go
- /swift
- /vim
- /clojure
- /emacs
- /haskell
- /java
- /onivim
- /svelte
- /typescript
- /crystal
- /c-plus-plus
- /kotlin
- /tailwind
- /gleam
- /ocaml
- /react
- /elm
- /flutter
- /vscode
- /ash
- /opensuse
- /centos
- /php
- /deepseek
- /html
- /zig
- /scala
- /sublime-text
- /textmate
- /debian
- /nixos
- /lisp
- /agda
- /react-native
- /kubuntu
- /arch-linux
- /ubuntu
- /revery