Fl4m3Ph03n1x

Ecto multiple streams in 1 transaction

Background

PS: the following situation describes an hypothetical scenario, where I own a company that sells things to customers.

I have an Ecto query that is so big, that my machine cannot handle it. With billions of results returned, there is probably not enough RAM in the world that can handle it.

The solution here (or so my research indicates) is to use streams. Streams were made for potentially infinite sets of results, which would fit my use case.

https://hexdocs.pm/ecto/Ecto.Repo.html#c:stream/2

Problem

So lets imagine that I want to delete All users that bought a given item. Maybe that item was not really legal in their country, and now me, the poor guy in IT, has to fix things so the world doesn’t come down crashing.

Naive way:

item_id = "123asdasd123"

purchase_ids =
      Purchases
      |> where([p], p.item_id == ^item_id)
      |> select([p], p.id)
      |> Repo.all()

Users
    |> where([u], u.purchase_id in ^purchase_ids)
    |> Repo.delete_all()

This is the naive way. I call it naive, because of 2 issues:

We have so many purchases, that the machine’s memory will overflow (looking at purchase_ids query)
purchase_ids will likely have more than 100K ids, so the second query (where we delete things) will fail as it hits Postgres parameters limit of 32K: https://stackoverflow.com/a/42251312/1337392

What can I say, our product is highly addictive and very well priced!
Our customers simply cant get enough of it. Don’t know why. Nope. No reason comes to mind. None at all.

With these problems in mind, I cannot help my customers and grow my empire, I mean, little home owned business.

I did find this possible solution:

Stream way:

item_id = "123asdasd123"

purchase_ids =
      Purchases
      |> where([p], p.item_id == ^item_id)
      |> select([p], p.id)

stream = Repo.stream(purchase_ids)

Repo.transacion(fn -> 
  ids = Enum.to_list(stream)

  Users
    |> where([u], u.purchase_id in ^ids)
    |> Repo.delete_all()
end)

Questions

However, I am not convinced this will work:

I am using Enum.to_list and saving everything into a variable, placing everything into memory again. So I am not gaining any advantage by using Repo.stream.
I still have too many ids for my Repo.delete_all to work without blowing up

I guess the one advantage here is that this now a transaction, so either everything goes or nothing goes.

So, the following questions arise:

How do I properly make use of streams in this scenario?
Can I delete items by streaming parameters (ids) or do I have to manually batch them?
Can I stream ids to Repo.delete_all ?

2 comments

/elixir #backend

5 1448 2

2022-08-05 11:22:22 UTC

Marked As Solved

Fl4m3Ph03n1x

Every question post created here creates an entry in a dedicated thread in the official forum iirc. Nonetheless, I still post my questions in both places. And when I find an answer, I add it to both places as well.

I do this mainly for visibility, both for the community, and for the question itself, although the later one is less impactful due to the mentioned DevChat thread the official forum has.

Solutions

In regards to the question, there are two possible solutions.

One suggested by benwilson:

query = from u in Users,
  join: p in assoc(u, :purchase),
  where: p.item_id == ^item_id

Repo.delete_all(query)

And the other by Aleksei Matiushkin:

Repo.transacion(fn ->
  max_rows = 500

  purchase_ids
  |> Repo.stream(max_rows: max_rows)
  |> Stream.chunk_every(max_rows)
  |> Stream.each(fn ids ->
     Users
     |> where([u], u.purchase_id in ^ids)
     |> Repo.delete_all()
  end)
  |> Stream.run()
end, timeout: :infinity)

My pick

The first solution is great, but it requires the User Schema to have a belongs_to :purchase, Purchase definition in its schema. Unfortunately for me, this was a deal breaker, since changing any schemas in the project where I am working in is either not allowed or would result in a lengthy approval process.

So I went with the second solution that is self contained. It requires no changes to any schemas and it can work with the data as is.

Post #4

Also Liked

jaeyson

hi @Fl4m3Ph03n1x, this might be spammy but, have you tried to ask this via ElixirForums or slack? so other people can see this.

Post #3

Where Next?

View thread on forum

elixir

backend

Home Backend>Questions

/elixir #backend

5 1448 2

Last post

Popular Backend topics

Backend>Questions

Can I add Discourse as a forum feature to any website no matter its backend language?

Hello, Please, let’s say I have a website with user authentication made with Elixir/Phoenix, and now want to add a forum to it (using a ...

#forum #discourse

11 925 3

2020-04-12 06:28:46 UTC

New

Backend>Questions

I have a large SQL database with millions of records, and I’ve identified duplicate entries. What’s the most efficient way to find and re...

#question

1 546 0

2023-11-08 11:32:26 UTC

New

Backend>Questions

What is the difference between using `:references` and `:belongs_to` in a generate command in Rails?

What is the difference between using :references and :belongs_to in the following command? bin/rails generate scaffold LineItem product:...

/ruby /rails

5 660 2

2024-06-13 12:52:25 UTC

New

Backend>Questions

Connection backend to frontend for orders

Hi guys!! I´m studying and got a Full stack course but the course lacked a lot of support and and info to learn as it´s a course after wo...

#backend-development

2 302 1

2024-08-12 06:26:02 UTC

New

Backend>Questions

How to run Ollama deepseek-coder:6.7b-instruct-q4_K_M in Docker for CrewAI Agents?

Hi everyone, I’m trying to run deepseek-coder:6.7b-instruct-q4_K_M in Docker using Ollama to create an LLM that will be used by CrewAI a...

Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...

pragprog.com

#pragprog #ios #game-dev #macos /swift #published-book #apple /book-apple-game-frameworks-and-technologies

30 6234 10

2021-04-22 16:51:02 UTC

New

General Dev>Hardware

Seen any cool new keyboards?

We have a thread about the keyboards we have, but what about nice keyboards we come across that we want? If you have seen any that look n...

/keyboards #mechanical-keyboards

49 5587 39

2025-05-10 22:54:44 UTC

New

General Dev>Hardware

Planck vs Preonic vs Subatomic (Keyboards)

I ended up cancelling my Moonlander order as I think it’s just going to be a bit too bulky for me. I think the Planck and the Preonic (o...

/keyboards #mechanical-keyboards #ortholinear #planck #preonic

105 16166 47

2021-05-28 21:32:35 UTC

New

macOS>Chat

My thoughts on macOS vs Linux

Small essay with thoughts on macOS vs. Linux: I know @Exadra37 is just waiting around the corner to scream at me “I TOLD YOU SO!!!” but I...

#macos #linux

166 8678 69

2021-04-10 22:36:29 UTC

New

General Dev>Sales

Get 50% off these PragProg books during our Think Again Sale

Think Again 50% Off Sale » The theme of this sale is new perspectives on familiar topics. Enter coupon code ThinkAgain2021 at checkout t...

#community #pragprog /elixir /book-a-common-sense-guide-to-data-structures-and-algorithms-second-edition /book-programming-machine-learning /book-the-ray-tracer-challenge /book-forge-your-future-with-open-source /book-software-design-x-rays #algorithms /book-testing-elixir #machine-learning #sale /book-concurrent-data-processing-in-elixir /book-intuitive-python

8 6210 2

2021-05-11 23:06:15 UTC

New

General Dev>Dev Chat

Roc Language - a new purely functional programming language built for speed and ergonomics

Hi folks, I don’t know if I saw this here but, here’s a new programming language, called Roc Reminds me a bit of Elm and thus Haskell. ...

#programminguages #functional-programming

49 4745 14

2021-11-10 20:03:09 UTC

New

General Dev>In The News

X can’t stop spread of explicit, fake AI Taylor Swift images

Will Swifties’ war on AI fakes spark a deepfake porn reckoning?

arstechnica.com

/swift

0 7404 0

2024-01-26 05:47:12 UTC

New

AI>In The News

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...

#ai #macs /deepseek

0 5576 1

2025-01-29 18:43:37 UTC

New

Latest in Elixir

Thinking Elixir 280 - Dark Matter Developers

Backend>Blogs/Talks

Ash Framework: Diving into validation

Backend>Blogs/Talks

Thinking Elixir 279 - Hot Code Upgrades and Hotter AI Takes

Backend>Blogs/Talks

Choosing Phoenix LiveView - The difficulties deciding between Phoenix LiveView and traditional frontend frameworks

Backend>Blogs/Talks

Ash Framework. Understanding actions from a functional programming perspective

Backend>Blogs/Talks

Thinking Elixir 278 - WAL-ing Through Database Changes

Backend>Blogs/Talks

What it’s like to bring Ash into an existing project?

Backend>Blogs/Talks

Thinking Elixir 276 - Elixir v1.19 Types and Speed

Backend>Blogs/Talks

Exploring how Ash works with Ecto and taking the opportunity to try a promising new library called Electric

Backend>Blogs/Talks

Taking a closer look at messy authorization

Backend>Blogs/Talks

Elixir Portal ❯

Backend>Questions

What do you think is a good direction to go for someone with a Rails background?

Backend>Questions

Anyone know how to get into Go from an Elixir background?

Backend>Questions

Are there any text-to-speech ai tools available using elixir?

Backend>Questions

How to run Ollama deepseek-coder:6.7b-instruct-q4_K_M in Docker for CrewAI Agents?

Backend>Questions

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

Backend>Questions

Connection backend to frontend for orders

Backend>Questions

Clarifications with the terms regarding augmenting AI in your code

Backend>Questions

What is the difference between using `:references` and `:belongs_to` in a generate command in Rails?

Backend>Questions

Dialyzer cannot recognize types from dependencies

Backend>Questions

Learning Elixir Phoenix and Ash

Backend>Questions

Backend Questions ❯

Latest on Devtalk

Launching the Julia Security Working Group

Backend>Official News

You can see a Quantum Computer in IBM’s London office

General Dev>In The News

Three Years from GPT-3 to Gemini 3

AI>In The News

Node.js v20.19.6 released!

Backend>Official News

Thinking Elixir 280 - Dark Matter Developers

Backend>Blogs/Talks

PostgreSQL: powa-archivist and powa-web 5.1.0 are out!

Backend>Official News

Rust: Interview with Jan David Nose

Backend>Official News

Mind-reading devices can now predict preconscious thoughts: is it time to worry?

AI>In The News

GrapheneOS migrates server infrastructure from France amid police intimidation claims

General Dev>In The News

Resources to learn how to compute HMM transition matrix?

AI>Questions

Pebble Watch Software Is Now 100% Open Source + Tick Talk #4 - PT2 Demos!

General Dev>In The News

Introducing advanced tool use on the Claude Developer Platform

AI>In The News

An entire PS5 now costs less than 64GB of DDR5 memory

Game Dev>In The News

V weekly.2025.48 released!

Backend>Official News

Ash v3.10.0 released!

Backend>Official News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Ecto multiple streams in 1 transaction

Fl4m3Ph03n1x

Ecto multiple streams in 1 transaction

Background

Problem

Questions

Marked As Solved

Fl4m3Ph03n1x

Solutions

My pick

Also Liked

jaeyson

Where Next?

Popular Backend topics

Can I add Discourse as a forum feature to any website no matter its backend language?

Not fully understanding the code in the example in "Learn to Program"

Proxying website use backend

Pytest: test fails while getting URL

Elixir Witchcraft IO monad?

(ArgumentError) unknown application: :bakeware

How to Efficiently Find and Remove Duplicates in a Large SQL Database?

What is the difference between using `:references` and `:belongs_to` in a generate command in Rails?

Connection backend to frontend for orders

How to run Ollama deepseek-coder:6.7b-instruct-q4_K_M in Docker for CrewAI Agents?

Other popular topics

How do you keep fit and healthy?

What are you watching?

Apple Game Frameworks and Technologies

Seen any cool new keyboards?

Planck vs Preonic vs Subatomic (Keyboards)

My thoughts on macOS vs Linux

Get 50% off these PragProg books during our Think Again Sale

Roc Language - a new purely functional programming language built for speed and ergonomics

X can’t stop spread of explicit, fake AI Taylor Swift images

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

Sponsor Spotlight

Latest in Elixir

Backend>Questions

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta