Next: Statistical Operations, Previous: Invoking datamash
, Up: Datamash [Contents][Index]
datamash
alternative syntax for --group
cross-tabulate two fields (also known as ’pivot-tables’)
transpose rows, columns of a text file
reverse fields in each line of a text file
verify tabular structure of input (ensure same number of fields in all lines)
remove lines with duplicated key value
encode the field as base64
decode the field as base64. Exit with an error if the field is invalid base64 value which cannot be decoded.
calculates md5 hash of the field
calculates sha1 hash of the field
calculates sha224 hash of the field
calculates sha256 hash of the field
calculates sha384 hash of the field
calculates sha512 hash of the field
extracts the directory name of the field (assuming the field is a file name).
Similar to dirname(1)
.
extracts the base file name of the field (assuming the field is a file name).
Similar to basename(1)
.
extracts the extension of the file name of the field (assuming the field is a file name).
extracts the base file name of the field without the extension (assuming the field is a file name).
extract a number from the field. getnum
accepts an optional single
letter option ‘n/i/d/p/h/o’ affecting the detected value.
copy input field to output field (similar to cut(1)
).
When the cut
operation is given a list of fields, the fields are copied
in the given order (in contrast to cut(1)
).
an alias for cut
.
sum the of values
minimum value
maximum value
minimum of the absolute values
maximum of the absolute values
range of values (maximum - minimum)
count number of elements in the group
the first value of the group
the last value of the group
one random value from the group
comma-separated sorted list of unique values
an alias for unique
.
--collapse-delimiter can be used to use a different character than comma.
comma-separated list of all input values
--collapse-delimiter can be used to use a different character than comma.
number of unique/distinct values
mean of the values
geometric mean of the values
harmonic mean of the values
trimmed mean of the values
mean square of the values
root mean square of the values
median value
1st quartile value
3rd quartile value
inter-quartile range
percentile value
mode value (most common value)
anti-mode value (least common value)
population standard deviation
sample standard deviation
population variance
sample variance
Median Absolute Deviation, scaled by a constant 1.4826 for normal distributions
Median Absolute Deviation, unscaled
skewness of the (sample) group
skewness of the (population) group
Excess Kurtosis of the (sample) group
Excess Kurtosis of the (population) group
p-value of the Jarque-Beta test for normality
p-value of the D’Agostino-Pearson Omnibus test for normality.
Next: Statistical Operations, Previous: Invoking datamash
, Up: Datamash [Contents][Index]