CSV 텍스트 줄에서 일부 필드만 추출하는 방법

Question 1

다음을 시도해 볼 수 있습니다.

grep -o "^[0-9]*\|,tran.*$" file | sed 'N;s/\n,/,/'

산출:

391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

Answer

다음을 시도해 볼 수 있습니다.

grep -o "^[0-9]*\|,tran.*$" file | sed 'N;s/\n,/,/'

산출:

391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

Question 2

파일에 쉼표나 개행 문자가 포함된 필드가 없다고 가정하면(예: "간단한 CSV 파일") 다음을 사용하여 각 줄에서 첫 번째와 마지막 두 필드를 가져올 수 있습니다.

$ awk -F , 'BEGIN { OFS=FS } { print $1, $(NF-1), $NF }' file.csv
391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

NF각 줄의 필드 수를 포함하는 특수 변수이며 입력 및 출력 필드 구분 기호를 모두 쉼표로 설정합니다. 블록 에서는 print관심 있는 필드만 인쇄합니다.

Answer

파일에 쉼표나 개행 문자가 포함된 필드가 없다고 가정하면(예: "간단한 CSV 파일") 다음을 사용하여 각 줄에서 첫 번째와 마지막 두 필드를 가져올 수 있습니다.

$ awk -F , 'BEGIN { OFS=FS } { print $1, $(NF-1), $NF }' file.csv
391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

NF각 줄의 필드 수를 포함하는 특수 변수이며 입력 및 출력 필드 구분 기호를 모두 쉼표로 설정합니다. 블록 에서는 print관심 있는 필드만 인쇄합니다.

CSV 텍스트 줄에서 일부 필드만 추출하는 방법

답변1

답변2

관련 정보