Como posso acrescentar uma contagem incremental a cada palavra predefinida de um arquivo de texto?

Question 1

Eu preferiria perlpara isso:

$ cat ip.txt 
He drove his car to the cinema. He then went inside the cinema to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema.

$ # forward counting is easy
$ perl -pe 's/\bcinema\b/$&.++$i/ge' ip.txt 
He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema3.

\bcinema\bpalavra a ser pesquisada, usando limites de palavras para que não corresponda como parte parcial de outra palavra. Por exemplo, \bpar\bnão corresponderá apartou parkouspar
gea gbandeira é para substituição global. epermite usar código Perl na seção de substituição
$&.++$ié a concatenação da palavra correspondente e do valor pré-incrementado $ique tem o valor padrão de0

Para reverter, precisamos primeiro obter a contagem...

$ c=$(grep -ow 'cinema' ip.txt | wc -l) perl -pe 's/\bcinema\b/$&.$ENV{c}--/ge' ip.txt 
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

ctorna-se variável de ambiente acessível através do hash%ENV

ou, perlsozinho, sorvendo o arquivo inteiro

perl -0777 -pe '$c=()=/\bcinema\b/g; s//$&.$c--/ge' ip.txt

Answer

Eu preferiria perlpara isso:

$ cat ip.txt 
He drove his car to the cinema. He then went inside the cinema to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema.

$ # forward counting is easy
$ perl -pe 's/\bcinema\b/$&.++$i/ge' ip.txt 
He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema3.

\bcinema\bpalavra a ser pesquisada, usando limites de palavras para que não corresponda como parte parcial de outra palavra. Por exemplo, \bpar\bnão corresponderá apartou parkouspar
gea gbandeira é para substituição global. epermite usar código Perl na seção de substituição
$&.++$ié a concatenação da palavra correspondente e do valor pré-incrementado $ique tem o valor padrão de0

Para reverter, precisamos primeiro obter a contagem...

$ c=$(grep -ow 'cinema' ip.txt | wc -l) perl -pe 's/\bcinema\b/$&.$ENV{c}--/ge' ip.txt 
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

ctorna-se variável de ambiente acessível através do hash%ENV

ou, perlsozinho, sorvendo o arquivo inteiro

perl -0777 -pe '$c=()=/\bcinema\b/g; s//$&.$c--/ge' ip.txt

Question 2

Com GNU awk para RS multi-char, correspondência sem distinção entre maiúsculas e minúsculas e limites de palavras:

$ awk -v RS='^$' -v ORS= -v word='cinema' '
    BEGIN { IGNORECASE=1 }
    { cnt=gsub("\\<"word"\\>","&"); while (sub("\\<"word"\\>","&"cnt--)); print }
' file
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

Answer

Com GNU awk para RS multi-char, correspondência sem distinção entre maiúsculas e minúsculas e limites de palavras:

$ awk -v RS='^$' -v ORS= -v word='cinema' '
    BEGIN { IGNORECASE=1 }
    { cnt=gsub("\\<"word"\\>","&"); while (sub("\\<"word"\\>","&"cnt--)); print }
' file
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

Question 3

Levando em consideração a pontuação após a palavra.
Numeração direta:

word="cinema"
awk -v word="$word" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" ++count,$i) 
        }
      print 
    }' input-file

Numeração inversa:

word="cinema"
count="$(awk -v word="$word" '
    { count += gsub(word, "") }
    END { print count }' input-file)"
awk -v word="$word" -v count="$count" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" count--, $i) 
        }
      print 
    }' input-file

Answer

Levando em consideração a pontuação após a palavra.
Numeração direta:

word="cinema"
awk -v word="$word" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" ++count,$i) 
        }
      print 
    }' input-file

Numeração inversa:

word="cinema"
count="$(awk -v word="$word" '
    { count += gsub(word, "") }
    END { print count }' input-file)"
awk -v word="$word" -v count="$count" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" count--, $i) 
        }
      print 
    }' input-file

Question 4

Para marcar a palavra em ordem decrescente invertemos o regex E invertemos os dados e finalmente invertemos a data mais uma vez para efetuar a transformação:

perl -l -0777pe '$_ = reverse reverse =~ s/(?=\bamenic\b)/++$a/gre' input.data

Resultado

He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema1.

Para marcar a palavra em ordem crescente, fazemos uma busca por trás da palavra:

perl -lpe 's/\bcinema\b\K/++$a/eg' input.data

Resultado

He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema3.

Answer

Para marcar a palavra em ordem decrescente invertemos o regex E invertemos os dados e finalmente invertemos a data mais uma vez para efetuar a transformação:

perl -l -0777pe '$_ = reverse reverse =~ s/(?=\bamenic\b)/++$a/gre' input.data

Resultado

He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema1.

Para marcar a palavra em ordem crescente, fazemos uma busca por trás da palavra:

perl -lpe 's/\bcinema\b\K/++$a/eg' input.data

Resultado

He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema3.

Como posso acrescentar uma contagem incremental a cada palavra predefinida de um arquivo de texto?

Responder1

Responder2

Responder3

Responder4

Resultado

Resultado

informação relacionada