hpr2752 :: XSV for fast CSV manipulations - Part 2
Part 2 of my introduction to the XSV tool
Hosted by Mr. Young on Tuesday, 2019-02-19 is flagged as Clean and is released under a CC-BY-SA license.
xsv, csv, command-line, data.
(Be the first).
The show is available on the Internet Archive at: https://archive.org/details/hpr2752
Listen in ogg,
spx,
or mp3 format. Play now:
Duration: 00:22:39
general.
XSV for fast CSV manipulations - Part 1: Basic Usage
Introduction
xsv is a command line program for indexing, slicing, analyzing, splitting and joining CSV files. Commands should be simple, fast and composable:
- Simple tasks should be easy.
- Performance trade offs should be exposed in the CLI interface.
- Composition should not come at the expense of performance.
We will be using the CSV file provided in the documentation.
Commands covered in this episode
- fixedlengths - Force a CSV file to have same-length records by either padding or truncating them.
- fmt - Reformat CSV data with different delimiters, record terminators or quoting rules. (Supports ASCII delimited data.)
- input - Read CSV data with exotic quoting/escaping rules.
- partition - Partition CSV data based on a column value.
- split - Split one CSV file into many CSV files of N chunks.
- sample - Randomly draw rows from CSV data using reservoir sampling (i.e., use memory proportional to the size of the sample).
- cat - Concatenate CSV files by row or by column.