計算文字檔案中第二列的每一行的非零數字

Question 1

您可以使用-cgrep 選項。您可以使用以下命令刪除第一個逗號之前的所有字元以及第二個逗號之後的所有內容sed：

sed 's/^[^,]*,//;s/,.*//' < the_file | grep -c -E '[^0]'

編輯：此sed命令的作用與您的命令相同，cut因此您也應該能夠使用原始grep命令。

EDIT2：如果您只想使用一個命令，您可以使用 @cuonglm grp 答案。如果您只想使用一次調用為了總結最後的行數，需要sed對標籤進行大量工作。

sed -E -n '
    s/^[^,]*,[^0,]+,.*/+1/   # replace the lines we are interested in with "+1"
    T delete_line            # if we did not do a substitution right now we jump to "delete_line"
    H                        # we did not jump (so we did the substitution and append the "+1" to the hold space
    : delete_line            # the label, here we do nothing (silently drop the current line)
    $ {                      # on the last line we ...
        s/.*/0/              # replace the whole line with "0"
        G                    # append the hold space (all the "+1" from before")
        s/\n//g              # remove all newlines
        p                    # print the line
    }' < the_file

現在可以將其通過管道傳輸bc，或者您可以p用一些複雜的sed魔法替換該命令，以將這些數字匯總到sed.我相信我聽說這sed已經完成，所以它應該是可能的。

如果你只想使用一個程式( sed) 但不介意多次呼叫它，這樣會容易得多：

sed '/^[^,]*,0,.*/d' < the_file | sed -n '$='

Answer

您可以使用-cgrep 選項。您可以使用以下命令刪除第一個逗號之前的所有字元以及第二個逗號之後的所有內容sed：

sed 's/^[^,]*,//;s/,.*//' < the_file | grep -c -E '[^0]'

編輯：此sed命令的作用與您的命令相同，cut因此您也應該能夠使用原始grep命令。

EDIT2：如果您只想使用一個命令，您可以使用 @cuonglm grp 答案。如果您只想使用一次調用為了總結最後的行數，需要sed對標籤進行大量工作。

sed -E -n '
    s/^[^,]*,[^0,]+,.*/+1/   # replace the lines we are interested in with "+1"
    T delete_line            # if we did not do a substitution right now we jump to "delete_line"
    H                        # we did not jump (so we did the substitution and append the "+1" to the hold space
    : delete_line            # the label, here we do nothing (silently drop the current line)
    $ {                      # on the last line we ...
        s/.*/0/              # replace the whole line with "0"
        G                    # append the hold space (all the "+1" from before")
        s/\n//g              # remove all newlines
        p                    # print the line
    }' < the_file

現在可以將其通過管道傳輸bc，或者您可以p用一些複雜的sed魔法替換該命令，以將這些數字匯總到sed.我相信我聽說這sed已經完成，所以它應該是可能的。

如果你只想使用一個程式( sed) 但不介意多次呼叫它，這樣會容易得多：

sed '/^[^,]*,0,.*/d' < the_file | sed -n '$='

Question 2

和grep：

grep -c '^[^,]*,[^0]' <file

-0僅當第二列的形式類似於整數而不是,時才有效+0。對於更一般的情況，請參閱@Stéphane Chazelas 的回答。

Answer

和grep：

grep -c '^[^,]*,[^0]' <file

-0僅當第二列的形式類似於整數而不是,時才有效+0。對於更一般的情況，請參閱@Stéphane Chazelas 的回答。

Question 3

grep -c '^[^,]*,[-+0-9.]*[1-9]'

這應該涵蓋表示為12, -1, 0e+12, 01, 的數字0.0001。但不是 for0xFF或InforNaN例如，所以這仍然與更規範的不同：

POSIXLY_CORRECT=1 awk -v n=0 -F , '$2 != 0 {n++}; END{print n}'

如果您的輸入有以這種格式表示的數字。

對於sed唯一的解決方案，您可以這樣做：

sed '/^[^,]*,[-+0-9]*[1-9]/!d' | sed -n '$='

但對於只有一次調用的解決方案sed，我們需要手動進行算術。

sed -n '
  1{x;s/$/0,:0123456789,0/;x;}
  /^[^,]*,[-+0-9]*[1-9]/ {
    x;:1
    s/^,/1/;s/\(.\),\(.*:.*\1\(,*.\)\)/\3\2/;t1
    s/:/,:/
    x
  }
  ${x;s/,.*//p;}'

Answer

grep -c '^[^,]*,[-+0-9.]*[1-9]'

這應該涵蓋表示為12, -1, 0e+12, 01, 的數字0.0001。但不是 for0xFF或InforNaN例如，所以這仍然與更規範的不同：

POSIXLY_CORRECT=1 awk -v n=0 -F , '$2 != 0 {n++}; END{print n}'

如果您的輸入有以這種格式表示的數字。

對於sed唯一的解決方案，您可以這樣做：

sed '/^[^,]*,[-+0-9]*[1-9]/!d' | sed -n '$='

但對於只有一次調用的解決方案sed，我們需要手動進行算術。

sed -n '
  1{x;s/$/0,:0123456789,0/;x;}
  /^[^,]*,[-+0-9]*[1-9]/ {
    x;:1
    s/^,/1/;s/\(.\),\(.*:.*\1\(,*.\)\)/\3\2/;t1
    s/:/,:/
    x
  }
  ${x;s/,.*//p;}'

計算文字檔案中第二列的每一行的非零數字

答案1

答案2

答案3

相關內容