CommunityNews

Self-Retrieval: Building an information retrieval system with one LLM

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
The rise of large language models (LLMs) has transformed the role of information retrieval (IR) systems in the way to humans accessing information. Due to the isolated architecture and the limited interaction, existing IR systems are unable to fully accommodate the shift from directly providing information to humans to indirectly serving large language models. In this paper, we propose Self-Retrieval, an end-to-end, LLM-driven information retrieval architecture that can fully internalize the required abilities of IR systems into a single LLM and deeply leverage the capabilities of LLMs during IR process. Specifically, Self-retrieval internalizes the corpus to retrieve into a LLM via a natural language indexing architecture. Then the entire retrieval process is redefined as a procedure of document generation and self-assessment, which can be end-to-end executed using a single large language model. Experimental results demonstrate that Self-Retrieval not only significantly outperforms previous retrieval approaches by a large margin, but also can significantly boost the performance of LLM-driven downstream applications like retrieval augumented generation.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#llm

1 389 0

2024-03-09 18:39:37 UTC

Where Next?

View thread on forum

llm

Home General Dev>In The News

#llm

1 389 0

Last post

Popular General Dev topics

General Dev>In The News

Fuzix: A Unix-ish operating system for small machines by Alan Cox

FUZIX FUZIX is a fusion of various elements from the assorted UZI forks and branches beaten together into some kind of semi-coherent pla...

fuzix.org

#unix

0 1751 0

2021-01-04 22:15:21 UTC

New

General Dev>In The News

Helix, a Kakoune inspired Vim-model text editor (written in Rust)

Yet another rust-made text editor, though I’m really liking the looks of how this one works!

/rust

5 2417 1

2022-03-30 14:44:03 UTC

New

General Dev>In The News

8 Reasons to ditch Chrome and use Firefox

8 reasons to ditch Chrome and switch to Firefox. Chrome may dominate, but Firefox is a known name among browsers for a reason. Whether y...

pcworld.com

#chrome #firefox

73 1728 41

2022-07-14 16:50:27 UTC

New

General Dev>In The News

How to design a good API and why it matters (2006)

ABSTRACT In lieu of a traditional , I’ve tried to distill the essence of the talk into a collection of maxims: All programmers are API ...

dl.acm.org

#api #design

2 1250 1

2022-10-07 10:11:24 UTC

New

General Dev>In The News

Rust has been forked to the Crab Language

GitHub - crablang/crab: A community fork of a language named after a plant fungus. All of the memory-safe features you love, now with 100...

github.com

/rust

0 1222 1

2023-06-06 23:55:03 UTC

New

General Dev>In The News

Why Python is terrible

Why Python is terrible… Nice language, but unsuitable for most professional purposes

josvisser.substack.com

/python

8 1174 6

2024-04-06 04:17:41 UTC

New

General Dev>In The News

X can’t stop spread of explicit, fake AI Taylor Swift images

Will Swifties’ war on AI fakes spark a deepfake porn reckoning?

arstechnica.com

/swift

0 7404 0

2024-01-26 05:47:12 UTC

New

General Dev>In The News

To avoid being replaced by LLMs, do what they can't

To avoid being replaced by LLMs, do what they can’t. What LLM’s can’t do yet

seangoedecke.com

18 651 7

2025-04-14 19:56:29 UTC

New

General Dev>In The News

olmOCR – Open-Source OCR for Accurate Document Conversion

olmOCR is an open-source tool for converting PDFs to text with high accuracy, preserving reading order and supporting tables, equations, ...

olmocr.allenai.org

2 737 1

2025-03-09 05:08:33 UTC

New

General Dev>In The News

Llama.cpp AI Performance with the GeForce RTX 5090 Review

In beginning the NVIDIA Blackwell Linux testing with the GeForce RTX 5090 compute performance, besides all the CUDA/OpenCL/OptiX benchmar...

phoronix.com

#performance #cpp #llama #geforce

0 911 1

2025-03-21 12:10:45 UTC

New

Other popular topics

General Dev>Dev Chat

What are you listening to?

A thread that every forum needs! Simply post a link to a track on YouTube (or SoundCloud or Vimeo amongst others!) on a separate line an...

#community #music

202 4935 102

2025-07-26 22:00:31 UTC

New

Backend>Chat

Would you use Erlang now when there is Elixir?

Why, if your answer is yes?

/elixir /erlang

167 4700 52

2021-04-22 18:15:44 UTC

New

Linux>Chat

RancherOS is in end of life

Oh just spent so much time on this to discover now that RancherOS is in end of life but Rancher is refusing to mark the Github repo as su...

#linux #rancheros

10 5767 6

2021-01-30 21:04:03 UTC

New

General Dev>Dev Chat

The V Programming Language

The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...

#programminguages /v

21 12589 7

2021-04-12 15:13:42 UTC

New

Backend>Learning Resources

Python Testing with pytest, Second Edition

Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...

pragprog.com

#pragprog /python #published-book /book-python-testing-with-pytest-second-edition

16 5119 4

2021-06-25 16:57:39 UTC

New

General Dev>Hardware

CharaChorder - type at the speed of thought?

Saw this on TikTok of all places! :lol: Anyone heard of them before? Lite:

/keyboards #charachorder

13 4253 4

2021-10-07 21:33:25 UTC

New

macOS>Chat

How to block any website on Mac using Little Snitch

If you want a quick and easy way to block any website on your Mac using Little Snitch simply… File > New Rule: And select Deny, O...

#macos #how-to #littlesnitch

5 9782 3

2022-07-05 00:59:40 UTC

New

Android>Questions

Clipboard readtext not working in android webview

Inside our android webview app, we are trying to paste the copied content from another app eg (notes) using navigator.clipboard.readtext ...

#android #clipboard

1 4678 0

2022-09-27 18:52:03 UTC

New

AI>In The News

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...

#ai #macs /deepseek

0 5576 1

2025-01-29 18:43:37 UTC

New

Community>In The Spotlight

AMA with: Mark Volkmann (codebar Winter Lit Fest)

Ask Me Anything with Mark Volkmann @mvolkmann On February 24 and 25, we are giving you a chance to ask questions of PragProg author M...

/book-server-driven-web-apps-with-htmx #codebar-spotlight

37 1880 20

2025-02-26 21:39:39 UTC

New

General Dev>In The News

Petition to recognise open source work as civic service in Germany

General Dev>In The News

Swedish publishers file police report against Meta's Zuckerberg for fraud

General Dev>In The News

Last Issue of "ECMAScript News"

General Dev>In The News

Cloudflare outage should not have happened, and they seem to be missing the point on how to avoid it in the future

General Dev>In The News

Chat Control: EU lawmakers finally agree on the voluntary scanning of your private chats

General Dev>In The News

Bad UX World Cup

General Dev>In The News

A New Bridge Links the Strange Math of Infinity to Computer Science

General Dev>In The News

You can see a Quantum Computer in IBM’s London office

General Dev>In The News

GrapheneOS migrates server infrastructure from France amid police intimidation claims

General Dev>In The News

Pebble Watch Software Is Now 100% Open Source + Tick Talk #4 - PT2 Demos!

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

A trillion dollars is a terrible thing to waste

AI>In The News

Petition to recognise open source work as civic service in Germany

General Dev>In The News

Swedish publishers file police report against Meta's Zuckerberg for fraud

General Dev>In The News

Advent of Code 2025: A Kotlin Playground

Backend>Official News

PostgreSQL: Pgpool-II 4.7 beta1 is now released

Backend>Official News

Django: 2026 DSF Board Election Results

Backend>Official News

The chip made for the AI inference era – the Google TPU

AI>In The News

Same-day upstream Linux support for Snapdragon 8 Elite Gen 5 mobile platform

Linux>In The News

The Input Stack on Linux

Linux>In The News

AI CEO – Replace your boss before they replace you

AI>In The News

Elixir v1.19.4 released!

Backend>Official News

Fara-7B: An Efficient Agentic Model for Computer Use

AI>In The News

Last Issue of "ECMAScript News"

General Dev>In The News

Linux Kernel Explorer

Linux>In The News

The Current State of the Theory that GPL Propagates to AI Models Trained on GPL Code

AI>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Self-Retrieval: Building an information retrieval system with one LLM

CommunityNews

Self-Retrieval: Building an information retrieval system with one LLM

Where Next?

Popular General Dev topics

Fuzix: A Unix-ish operating system for small machines by Alan Cox

Helix, a Kakoune inspired Vim-model text editor (written in Rust)

8 Reasons to ditch Chrome and use Firefox

How to design a good API and why it matters (2006)

Rust has been forked to the Crab Language

Why Python is terrible

X can’t stop spread of explicit, fake AI Taylor Swift images

To avoid being replaced by LLMs, do what they can't

olmOCR – Open-Source OCR for Accurate Document Conversion

Llama.cpp AI Performance with the GeForce RTX 5090 Review

Other popular topics

What are you listening to?

Would you use Erlang now when there is Elixir?

RancherOS is in end of life

The V Programming Language

Python Testing with pytest, Second Edition

CharaChorder - type at the speed of thought?

How to block any website on Mac using Little Snitch

Clipboard readtext not working in android webview

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

AMA with: Mark Volkmann (codebar Winter Lit Fest)

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta