すべてのパターンがファイル内にあるか確認する

Question 1

cat file.txt | awk '
    NR == FNR {seen[$0] = 0; next} 
    {for (p in seen) if ($0 ~ p) seen[p]++} 
    END {
        for (p in seen) 
            if (seen[p] == 0) {
                missing++
                print "missing pattern", p
            } 
        if (missing == 0) print "all found"
        exit missing
    }
' patterns.txt -

catコマンドを、テキストを生成するパイプラインに置き換えます。

Answer

cat file.txt | awk '
    NR == FNR {seen[$0] = 0; next} 
    {for (p in seen) if ($0 ~ p) seen[p]++} 
    END {
        for (p in seen) 
            if (seen[p] == 0) {
                missing++
                print "missing pattern", p
            } 
        if (missing == 0) print "all found"
        exit missing
    }
' patterns.txt -

catコマンドを、テキストを生成するパイプラインに置き換えます。

Question 2

これはうまくいくかもしれません:

sort -u patterns.txt > sorted_patterns.txt # only once
diff -sq <(grep -o -f sorted_patterns.txt file.txt | sort -u) sorted_patterns.txt

パターンではなく固定文字列がある場合は、を使用します-F。これにより、grep処理速度が大幅に向上します。

あなたは出来るまた使用するcmpの代わりにを使用しますdiff -s。少し速くなるかもしれませんが、何が欠けているかを表示することはできません。

すべてのパターンが見つからなかった場合は出力します。

Files /dev/fd/63 and /dev/fd/62 differ

またはすべてのパターンが見つかった場合:

Files /dev/fd/63 and /dev/fd/62 are identical

-q知るために残す何不足している。

2a3
> missing_word

Answer