
需要從具有重複條目的兩個不同文件中進行 vlookup:
文件1
abc 10
xyz 20
bhy 30
hgf 40
文件2
a abc
b xyz
c bhy
d abc
e abc
f xyz
所需輸出:
abc 10 a,d,e
xyz 20 b,f
bhy 30 c
hgf 40 Not_Available
答案1
Awk
解決方案:
awk 'NR == FNR { a[$1] = $2 OFS; next }
$2 in a { a[$2] = a[$2] (a[$2] ~ /\t$/? "" : ",") $1 }
END { for (i in a) print i, a[i] }' OFS='\t' file1 file2
輸出:
bhy 30 c
abc 10 a,d,e
xyz 20 b,f
對於最後一個條件,請使用以下修改:
awk 'NR == FNR { a[$1] = $2 OFS; next }
$2 in a { a[$2] = a[$2] (a[$2] ~ /\t$/? "" : ",") $1 }
END {
for (i in a) print i, a[i] (a[i] ~ /\t$/? "Not_Available" : "")
}' OFS='\t' file1 file2
答案2
用 awk
awk -v OFS='\t' '
NR == FNR {val[$1]=$2; next}
{items[$2] = items[$2] " " $1}
END {
for (a in val) {
sub(/^ /, "", items[a])
gsub(/ /, ",", items[a])
print a, val[a], items[a]
}
}
' file1 file2
bhy 30 c
abc 10 a,d,e
xyz 20 b,f
如果你想對其進行排序,請輸入| sort -k2,2n
只是為了好玩,一點點 Perl 的技巧
perl -lae '
if ($. == ++$nr) { $val{$F[0]} = $F[1] }
else { push @{$items{$F[1]}}, $F[0] }
} continue {
close ARGV if eof
} END {
printf "%s\t%s\t%s\n", $_, $val{$_}, join(",", @{$items{$_}}) for keys %items
' file1 file2