如何在 awk/shell 中逐行合併具有相同記錄的 2 個檔案？

Question 1

如果你想使用 awk：

$ awk 'NR==FNR {a[$1] = $2; next} $1 in a {print $1, $2, a[$1]}' file2.txt file1.txt 
Mary 68 74
Tom 50 26
Jason 45 37

不需要排序，輸出將按照給定的第二個文件的順序排列。

解釋：

NR==FNR是從第一個命名檔案中選擇記錄的規範方法
{a[$1] = $2; next}使用第一個欄位中的鍵和第二個欄位中的值填入數組
$1 in a如果第一個欄位已在第一個文件中看到；然後
{print $1, $2, a[$1]}列印第二個文件中的鍵和值以及第一個文件中的值

Answer

如果你想使用 awk：

$ awk 'NR==FNR {a[$1] = $2; next} $1 in a {print $1, $2, a[$1]}' file2.txt file1.txt 
Mary 68 74
Tom 50 26
Jason 45 37

不需要排序，輸出將按照給定的第二個文件的順序排列。

解釋：

NR==FNR是從第一個命名檔案中選擇記錄的規範方法
{a[$1] = $2; next}使用第一個欄位中的鍵和第二個欄位中的值填入數組
$1 in a如果第一個欄位已在第一個文件中看到；然後
{print $1, $2, a[$1]}列印第二個文件中的鍵和值以及第一個文件中的值

Question 2

這聽起來像是一份工作join，關聯式資料庫運算符

join <(sort file1.txt) <(sort file2.txt)

測試

$ cat file1.txt
Mary 68
Tom 50
Jason 45
Lu 66

$ cat file2.txt
Jason 37
Tom 26
Mary 74
Tina 80

$ join <(sort file1.txt) <(sort file2.txt)
Jason 45 37
Mary 68 74
Tom 50 26

join是 POSIX 中指定的標準工具。

手冊join頁指出：

The files file1 and file2 shall be ordered in the collating sequence of sort -b on the 
fields on which they shall be joined, by default the first in each line. All selected 
output shall be written in the same collating sequence.

Answer

這聽起來像是一份工作join，關聯式資料庫運算符

join <(sort file1.txt) <(sort file2.txt)

測試

$ cat file1.txt
Mary 68
Tom 50
Jason 45
Lu 66

$ cat file2.txt
Jason 37
Tom 26
Mary 74
Tina 80

$ join <(sort file1.txt) <(sort file2.txt)
Jason 45 37
Mary 68 74
Tom 50 26

join是 POSIX 中指定的標準工具。

手冊join頁指出：

The files file1 and file2 shall be ordered in the collating sequence of sort -b on the 
fields on which they shall be joined, by default the first in each line. All selected 
output shall be written in the same collating sequence.

如何在 awk/shell 中逐行合併具有相同記錄的 2 個檔案？

答案1

答案2

相關內容