다른 txt 파일에 존재하는 txt 파일에서 단어를 삭제하는 방법은 무엇입니까?

Question 1

이를 수행하는 명령이 있습니다: comm. 에서 언급한 바와 같이 man comm, 이는 매우 간단합니다:

   comm -3 file1 file2
          Print lines in file1 not in file2, and vice versa.

comm파일 내용이 정렬될 것으로 예상하므로 다음 comm과 같이 호출하기 전에 파일 내용을 정렬해야 합니다.

sort unsorted-file.txt > sorted-file.txt

요약하면 다음과 같습니다.

sort a.txt > as.txt

sort b.txt > bs.txt

comm -3 as.txt bs.txt > result.txt

위 명령 뒤에는 파일에 예상되는 줄이 있습니다 result.txt.

Answer

이를 수행하는 명령이 있습니다: comm. 에서 언급한 바와 같이 man comm, 이는 매우 간단합니다:

   comm -3 file1 file2
          Print lines in file1 not in file2, and vice versa.

comm파일 내용이 정렬될 것으로 예상하므로 다음 comm과 같이 호출하기 전에 파일 내용을 정렬해야 합니다.

sort unsorted-file.txt > sorted-file.txt

요약하면 다음과 같습니다.

sort a.txt > as.txt

sort b.txt > bs.txt

comm -3 as.txt bs.txt > result.txt

위 명령 뒤에는 파일에 예상되는 줄이 있습니다 result.txt.

Question 2

다음은 다음을 기반으로 하는 짧은 python3 스크립트입니다.게르마르의 답변b.txt, 이는 의 정렬되지 않은 순서를 유지하면서 이를 수행해야 합니다 .

#!/usr/bin/python3

with open('a.txt', 'r') as afile:
    a = set(line.rstrip('\n') for line in afile)

with open('b.txt', 'r') as bfile:
    for line in bfile:
        line = line.rstrip('\n')
        if line not in a:
            print(line)
            # Uncomment the following if you also want to remove duplicates:
            # a.add(line)

Answer

다음은 다음을 기반으로 하는 짧은 python3 스크립트입니다.게르마르의 답변b.txt, 이는 의 정렬되지 않은 순서를 유지하면서 이를 수행해야 합니다 .

#!/usr/bin/python3

with open('a.txt', 'r') as afile:
    a = set(line.rstrip('\n') for line in afile)

with open('b.txt', 'r') as bfile:
    for line in bfile:
        line = line.rstrip('\n')
        if line not in a:
            print(line)
            # Uncomment the following if you also want to remove duplicates:
            # a.add(line)

Question 3

#!/usr/bin/env python3

with open('a.txt', 'r') as f:
    a_txt = f.read()
a = a_txt.split('\n')
del(a_txt)

with open('b.txt', 'r') as f:
    while True:
        b = f.readline().strip('\n ')
        if not len(b):
            break
        if not b in a:
            print(b)

Answer

#!/usr/bin/env python3

with open('a.txt', 'r') as f:
    a_txt = f.read()
a = a_txt.split('\n')
del(a_txt)

with open('b.txt', 'r') as f:
    while True:
        b = f.readline().strip('\n ')
        if not len(b):
            break
        if not b in a:
            print(b)

Question 4

coreutils 명령을 살펴보십시오 comm.man comm

NAME
       comm - compare two sorted files line by line

SYNOPSIS
       comm [OPTION]... FILE1 FILE2

DESCRIPTION
       Compare sorted files FILE1 and FILE2 line by line.

       With  no  options,  produce  three-column  output.  Column one contains
       lines unique to FILE1, column two contains lines unique to  FILE2,  and
       column three contains lines common to both files.

       -1     suppress column 1 (lines unique to FILE1)

       -2     suppress column 2 (lines unique to FILE2)

       -3     suppress column 3 (lines that appear in both files)

예를 들어 당신은 할 수 있습니다

$ comm -13 <(sort a.txt) <(sort b.txt)
diary.txt
NOVEMBER.txt

( 에 고유한 줄 b.txt)

Answer

coreutils 명령을 살펴보십시오 comm.man comm

NAME
       comm - compare two sorted files line by line

SYNOPSIS
       comm [OPTION]... FILE1 FILE2

DESCRIPTION
       Compare sorted files FILE1 and FILE2 line by line.

       With  no  options,  produce  three-column  output.  Column one contains
       lines unique to FILE1, column two contains lines unique to  FILE2,  and
       column three contains lines common to both files.

       -1     suppress column 1 (lines unique to FILE1)

       -2     suppress column 2 (lines unique to FILE2)

       -3     suppress column 3 (lines that appear in both files)

예를 들어 당신은 할 수 있습니다

$ comm -13 <(sort a.txt) <(sort b.txt)
diary.txt
NOVEMBER.txt

( 에 고유한 줄 b.txt)

다른 txt 파일에 존재하는 txt 파일에서 단어를 삭제하는 방법은 무엇입니까?

답변1

답변2

답변3

답변4

관련 정보