Extracting two (or more) related values from an array of JSON objects

Question

Consider a contrived example using a JSON object such as this, where I want to extract the related id, firstname, and lastname fields for each of many array objects into shell variables for further (non-JSON) processing.

{
  "customers": [
    {
      "id": 1234,
      "firstname": "John",
      "lastname": "Smith",
      "other": "fields",
      "are": "present",
      "here": "etc."
    },
    {
      "id": 2468,
      "firstname": "Janet",
      "lastname": "Green",
      "other": "values",
      "are": "probably",
      "here": "maybe"
    }
  ]
}

For simple data I can use this,

jq -r '.customers[] | (.id + " " + .firstname + " " + .lastname)' <data.json |
    while IFS=' ' read id firstname lastname
    do
        # More processing, but omitted for the example
        printf '%s -- %s -- %s\n' "$id" "$firstname" "$lastname"
    done

Output

1234 -- John -- Smith
2468 -- Janet -- Green

but of course this will fail with double-barrelled firstname values such as Anne Marie. Changing the separator to another character such as # feels more like a fudge than a solution but could be acceptable.

For more complex situations I might pick out the list of id values and then trade speed for accuracy by going back to extract the corresponding firstname and lastname elements. Something like this:

jq -r '.customers[].id' <data.json |
    while IFS= read id
    do
        block=$(jq -r --arg id "$id" '.customers[] | select(.id == $id)' <data.json); 

        firstname=$(jq -r '.firstname' <<<"$block")
        lastname=$(jq -r '.lastname' <<<"$block")

        # More processing, but omitted for the example
        printf '%s -- %s -- %s\n' "$id" "$firstname" "$lastname"
    done

Output

1234 -- John -- Smith
2468 -- Janet -- Green

However, neither of these is both correct and efficient. While I'm not going to be running the real code at a high frequency, I'd like to understand if there is a more appropriate way of getting multiple data elements safely and efficiently out of a JSON object structure and into shell variables?

Kusalananda · Accepted Answer · 2025-03-11 13:39:51Z

2

I don't really see the issue with picking another delimiter, such as tab. It's a pretty traditional choice of delimiter in Unix data processing applications (the default for paste and cut, for example), and it is less likely to occur as part of a name or ID number. And jq has a @tsv output operator that you could use:

jq -r '.customers[] | [.id, .firstname, .lastname] | @tsv' file |
while IFS=$'\t' read id first last
do
    printf '%s - %s - %s\n' "$id" "$first" "$last"
done

edited Mar 11 at 13:39

answered Mar 10 at 23:19

Kusalananda♦

356k42 gold badges735 silver badges1.1k bronze badges

1

Thanks. I'd have ✔ but I had two broadly similar answers

Chris Davies
– Chris Davies

2025-03-11 15:52:42 +00:00
Commented Mar 11 at 15:52

Add a comment |

Fravadona · Accepted Answer · 2025-03-10 23:21:51Z

1

I would use the @tsv filter:

jq -r '.customers[] | [.id, .firstname, .lastname] | @tsv' < data.json |
while IFS=$'\t' read -r id firstname lastname
do
    printf '%b -- %b -- %b\n' "$id" "$firstname" "$lastname"
done

Be careful with empty fields as the read could "break"; you should make sure to always output at least one character per field.

note: the %b are for decoding the @tsv escape sequences

answered Mar 10 at 23:21

Fravadona

1,6015 silver badges14 bronze badges

Add a comment |

Stack Exchange Network

Extracting two (or more) related values from an array of JSON objects

2 Answers 2

You must log in to answer this question.

Hot Network Questions

Extracting two (or more) related values from an array of JSON objects

2 Answers 2

You must log in to answer this question.

Related

Hot Network Questions