Counting associated records with Elixir and Ecto

Two ways to count associated records with Elixir and Ecto

5 January 2024

I’ve often found myself wanting to know how many associated records exist for a given entity in a project’s database, so I’ve captured two ways of doing it with Ecto here. Specifically:

  1. Using multiple database transactions, aggregates and combining the results
  2. Using one database transaction with inner join laterals

Please note that these can be quite costly transactions depending on your real scenario, so try to minimise their usage in common scenarios/hot paths.

For the purpose of this example, assume we have a very simple database like below, and we want to know how many Products and Consumers exist for a given Vendor.

I’ve added 2 virtual fields to the Vendor schema for us to populate with the counts. I’ve omitted the changeset for brevity.

defmodule Insight.Vendors.Vendor do
  use Ecto.Schema

  @primary_key {:id, :binary_id, autogenerate: true}
  @foreign_key_type :binary_id

  schema "vendors" do
    field :name, :string
    field :consumer_count, :integer, virtual: true, default: 0
    field :product_count, :integer, virtual: true, default: 0
    has_many :consumers, Insight.Consumers.Consumer
    has_many :products, Insight.Products.Product
  end
end

This means our Vendor struct will look something like this by default, and we want to populate consumer_count and product_count whenever we retrieve a specific vendor.

%Insight.Vendors.Vendor{
  __meta__: #Ecto.Schema.Metadata<:loaded, "vendors">,
  id: "4cda6cf6-dd4f-4264-9e8e-62f24f18c666",
  name: "Nike",
  consumer_count: 0,
  product_count: 0,
  consumers: #Ecto.Association.NotLoaded<association :consumers is not loaded>,
  products: #Ecto.Association.NotLoaded<association :products is not loaded>
}

1. Using multiple database transactions, aggregates and combining the results

# lib/insight/vendors.ex -- Multiple DB transactions example
def get_vendor!(id) do
  Vendor
  |> Repo.get!(id)
  |> add_assoc_count(:consumers, :consumer_count)
  |> add_assoc_count(:products, :product_count)
end

defp add_assoc_count(struct, assoc, key) do
  count =
    struct
    |> Ecto.assoc(assoc)
    |> Repo.aggregate(:count, :id)

  Map.put(struct, key, count)
end

This is the cleanest and most digestible solution I’ve found, making good use of Elixir and Ecto’s composability. However it does result in 3 transactions with the database in our example.

Our private function add_assoc_count/3 makes use of Ecto.assoc/3 to build a query for the given association (:consumers and :products), Ecto.Repo.aggregate/3 to count the records, and adds the result into the Vendor struct under the appropriate key (:consumer_count and :product_count) with Map.put/3.

2. Using one database transaction with inner join laterals

# lib/insight/vendors.ex -- LATERAL JOIN example
def get_vendor!(id) do
  consumer_subquery =
    Consumer
    |> where([c], c.vendor_id == parent_as(:vendor).id)
    |> select([c], %{consumer_count: count(c.id)})

  product_subquery =
    Product
    |> where([p], p.vendor_id == parent_as(:vendor).id)
    |> select([p], %{product_count: count(p.id)})

  Vendor
  |> from(as: :vendor)
  |> where(id: ^id)
  |> join(:inner_lateral, [], subquery(consumer_subquery), on: true)
  |> join(:inner_lateral, [], subquery(product_subquery), on: true)
  |> select([v, c, p], %{v | consumer_count: c.consumer_count, product_count: p.product_count})
  |> Repo.one!()
end

This achieves the same outcome but gets a fully processed result back from the database in one transaction.

It makes use of Ecto.Query.join/5 with a qualifier type of :inner_lateral and multiple subqueries via Ecto.Query.subquery/2.

Each subquery performs the count calculation, and then the final select statement creates the desired response containing the entire vendor struct and the two calculated fields.

It is less readable than #1 but may be more performant for your use case.

Copyright © 2024 Matthew Stibbard