Wie lösche ich Wörter aus einer TXT-Datei, die in einer anderen TXT-Datei vorhanden sind?

Question 1

Dafür gibt es einen Befehl: comm. Wie in beschrieben man comm, ist es ganz einfach:

   comm -3 file1 file2
          Print lines in file1 not in file2, and vice versa.

Beachten Sie, dass commder Inhalt der erwarteten Dateien sortiert sein muss. Sie müssen sie also vor dem Aufruf sortieren comm, und zwar so:

sort unsorted-file.txt > sorted-file.txt

Um es zusammenzufassen:

sort a.txt > as.txt

sort b.txt > bs.txt

comm -3 as.txt bs.txt > result.txt

Nach den obigen Befehlen verfügen Sie über die erwarteten Zeilen in der result.txtDatei.

Answer

Dafür gibt es einen Befehl: comm. Wie in beschrieben man comm, ist es ganz einfach:

   comm -3 file1 file2
          Print lines in file1 not in file2, and vice versa.

Beachten Sie, dass commder Inhalt der erwarteten Dateien sortiert sein muss. Sie müssen sie also vor dem Aufruf sortieren comm, und zwar so:

sort unsorted-file.txt > sorted-file.txt

Um es zusammenzufassen:

sort a.txt > as.txt

sort b.txt > bs.txt

comm -3 as.txt bs.txt > result.txt

Nach den obigen Befehlen verfügen Sie über die erwarteten Zeilen in der result.txtDatei.

Question 2

Hier ist ein kurzes Python3-Skript, basierend aufGermars Antwort, wodurch dies unter Beibehaltung b.txtder unsortierten Reihenfolge erreicht werden sollte.

#!/usr/bin/python3

with open('a.txt', 'r') as afile:
    a = set(line.rstrip('\n') for line in afile)

with open('b.txt', 'r') as bfile:
    for line in bfile:
        line = line.rstrip('\n')
        if line not in a:
            print(line)
            # Uncomment the following if you also want to remove duplicates:
            # a.add(line)

Answer

Hier ist ein kurzes Python3-Skript, basierend aufGermars Antwort, wodurch dies unter Beibehaltung b.txtder unsortierten Reihenfolge erreicht werden sollte.

#!/usr/bin/python3

with open('a.txt', 'r') as afile:
    a = set(line.rstrip('\n') for line in afile)

with open('b.txt', 'r') as bfile:
    for line in bfile:
        line = line.rstrip('\n')
        if line not in a:
            print(line)
            # Uncomment the following if you also want to remove duplicates:
            # a.add(line)

Question 3

#!/usr/bin/env python3

with open('a.txt', 'r') as f:
    a_txt = f.read()
a = a_txt.split('\n')
del(a_txt)

with open('b.txt', 'r') as f:
    while True:
        b = f.readline().strip('\n ')
        if not len(b):
            break
        if not b in a:
            print(b)

Answer

#!/usr/bin/env python3

with open('a.txt', 'r') as f:
    a_txt = f.read()
a = a_txt.split('\n')
del(a_txt)

with open('b.txt', 'r') as f:
    while True:
        b = f.readline().strip('\n ')
        if not len(b):
            break
        if not b in a:
            print(b)

Question 4

Schauen Sie sich den commBefehl coreutils an -man comm

NAME
       comm - compare two sorted files line by line

SYNOPSIS
       comm [OPTION]... FILE1 FILE2

DESCRIPTION
       Compare sorted files FILE1 and FILE2 line by line.

       With  no  options,  produce  three-column  output.  Column one contains
       lines unique to FILE1, column two contains lines unique to  FILE2,  and
       column three contains lines common to both files.

       -1     suppress column 1 (lines unique to FILE1)

       -2     suppress column 2 (lines unique to FILE2)

       -3     suppress column 3 (lines that appear in both files)

So können Sie zum Beispiel

$ comm -13 <(sort a.txt) <(sort b.txt)
diary.txt
NOVEMBER.txt

(Zeilen, die eindeutig sind für b.txt)

Answer

Schauen Sie sich den commBefehl coreutils an -man comm

NAME
       comm - compare two sorted files line by line

SYNOPSIS
       comm [OPTION]... FILE1 FILE2

DESCRIPTION
       Compare sorted files FILE1 and FILE2 line by line.

       With  no  options,  produce  three-column  output.  Column one contains
       lines unique to FILE1, column two contains lines unique to  FILE2,  and
       column three contains lines common to both files.

       -1     suppress column 1 (lines unique to FILE1)

       -2     suppress column 2 (lines unique to FILE2)

       -3     suppress column 3 (lines that appear in both files)

So können Sie zum Beispiel

$ comm -13 <(sort a.txt) <(sort b.txt)
diary.txt
NOVEMBER.txt

(Zeilen, die eindeutig sind für b.txt)

Wie lösche ich Wörter aus einer TXT-Datei, die in einer anderen TXT-Datei vorhanden sind?

Antwort1

Antwort2

Antwort3

Antwort4

verwandte Informationen