Similar to Improving performance with a common low-cardinality field
We get a large dataset and we load it by source. Let's say team_id. We currently have our data partitioned by team_id and then by the timestamps of events. However, this does mean the team_id ends up being repeated over each monthly (for e.g.) partition for that team. Is there a way to save space both in the tables and in any indexes we use that would include team_id?