Similar to r_31415's answer, but without having to name any other field than the field that we want to modify or remembering the order of the fields (also similar to roaima's answer in that respect):
csvsql --query 'UPDATE stdin SET "Second Header" = substr("Second Header",1,2); SELECT * FROM stdin' <file
A slightly more flexible approach which allows us to slot in any JSON-capable tool to do the needed transformations on the data:
- Using
csvjson
from csvkit to create a JSON array of the CSV data.
- Using
jq
to read the JSON data, and to truncate the Second Header
values at two characters.
- Using
in2csv
from csvkit to read the JSON and to create CSV.
csvjson file | jq 'map(."Second Header" |= .[:2])' | in2csv -f json
The output of csvjson
, given the CSV data in the question, is the equivalent of
[
{
"First Header": "foo",
"Second Header": null
},
{
"First Header": "foo",
"Second Header": "b"
},
{
"First Header": "foo",
"Second Header": "bar"
},
{
"First Header": "foo",
"Second Header": "bar"
},
{
"First Header": "foo",
"Second Header": "\"bar"
},
{
"First Header": "foo",
"Second Header": "\nbar"
}
]
The output of jq
would then be
[
{
"First Header": "foo",
"Second Header": null
},
{
"First Header": "foo",
"Second Header": "b"
},
{
"First Header": "foo",
"Second Header": "ba"
},
{
"First Header": "foo",
"Second Header": "ba"
},
{
"First Header": "foo",
"Second Header": "\"b"
},
{
"First Header": "foo",
"Second Header": "\nb"
}
]
And the final output:
First Header,Second Header
foo,
foo,b
foo,ba
foo,ba
foo,"""b"
foo,"
b"