Difference between revisions of "Awk"

Revision as of 18:14, 28 December 2018

awk -F"," -v VAR=<value> '{<code}': In awk: Use "," as field-separator, set VAR to <value> in.

Arrays can have named keys, and work a bit like a python dict or Perl hashes.

Output each string found in the array <ar> only 1 time.

{
 split($2,ar,"/")
 dict[ar[1]] = 1
} END {
 for (key in dict) 
 {
  OUT=OUT" "key
 }
 print OUT
}

substr(<string>,<start>,<num>): From <sting> return <num> characters, starting from <start>
n=split(var,ARR,<fs>): Split var in array ARR, n holds the number of elements in ARR, <fs> is the field separator, if not given the variable FS is used as field separator (default white space).

Default action of awk for read line is:

NF=split(var,ARR," ")
NR++
$0=var
for ( i=1 ; i<=NF ; i++ ) {
 $i=ARR[i]
}

gsub(<regexp>,<string>,<variable>): Replace <regexp> with <string> in <variable>. Return number of replacements.; <variable> is modified.

printf "%-10s %05d\n", $1, $2: Format output like in Python:Strings#Advanced

Print average of field 4 for all records in <file> containing 'GW'

awk 'BEGIN {}
/GW/ {GW+=$4}
END {print GW/NR}' <file>

var ~ /<pattern>/ {code}: Execute code if var matches <pattern>; !~ to negate the match

gawk '{if (NR%3 == 0) {print p$0;p=""}else{p=p$0}}': Combine lines per 3 (Basic code found on stackoverflow [1])

@@ Line 16: / Line 16: @@
 ;<code>a=length(field)</code>
 :Get the length of a field.
+==Arrays==
+Arrays can have named keys, and work a bit like a [[Python:DataTypes#Dirctionary_or_dict| python dict]] or [[Perl]] hashes.
+Output each string found in the array <ar> only 1 time.
+<syntaxhighlight lang=awk>
+{
+ split($2,ar,"/")
+ dict[ar[1]] = 1
+} END {
+ for (key in dict)
+ {
+  OUT=OUT" "key
+ }
+ print OUT
+}
+</syntaxhighlight>