Как можно добавить инкрементный счетчик к каждому предопределенному слову текстового файла?

Question 1

Я бы предпочел perlдля этого:

$ cat ip.txt 
He drove his car to the cinema. He then went inside the cinema to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema.

$ # forward counting is easy
$ perl -pe 's/\bcinema\b/$&.++$i/ge' ip.txt 
He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema3.

\bcinema\bслово для поиска, используя границы слов, чтобы оно не совпадало как частичная часть другого слова. Например, \bpar\bне будет соответствовать apartили parkилиspar
geфлаг gпредназначен для глобальной замены. eпозволяет использовать код Perl в разделе замены
$&.++$iпредставляет собой конкатенацию совпавшего слова и предварительно увеличенного значения, $iкоторое имеет значение по умолчанию0

Для обратного нам сначала нужно получить количество...

$ c=$(grep -ow 'cinema' ip.txt | wc -l) perl -pe 's/\bcinema\b/$&.$ENV{c}--/ge' ip.txt 
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

cстановится переменной окружения, доступной через хэш%ENV

или, в perlодиночку, прихлебывая весь файл

perl -0777 -pe '$c=()=/\bcinema\b/g; s//$&.$c--/ge' ip.txt

Answer

Я бы предпочел perlдля этого:

$ cat ip.txt 
He drove his car to the cinema. He then went inside the cinema to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema.

$ # forward counting is easy
$ perl -pe 's/\bcinema\b/$&.++$i/ge' ip.txt 
He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema3.

\bcinema\bслово для поиска, используя границы слов, чтобы оно не совпадало как частичная часть другого слова. Например, \bpar\bне будет соответствовать apartили parkилиspar
geфлаг gпредназначен для глобальной замены. eпозволяет использовать код Perl в разделе замены
$&.++$iпредставляет собой конкатенацию совпавшего слова и предварительно увеличенного значения, $iкоторое имеет значение по умолчанию0

Для обратного нам сначала нужно получить количество...

$ c=$(grep -ow 'cinema' ip.txt | wc -l) perl -pe 's/\bcinema\b/$&.$ENV{c}--/ge' ip.txt 
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

cстановится переменной окружения, доступной через хэш%ENV

или, в perlодиночку, прихлебывая весь файл

perl -0777 -pe '$c=()=/\bcinema\b/g; s//$&.$c--/ge' ip.txt

Question 2

С GNU awk для многосимвольного RS, совпадения без учета регистра и границ слов:

$ awk -v RS='^$' -v ORS= -v word='cinema' '
    BEGIN { IGNORECASE=1 }
    { cnt=gsub("\\<"word"\\>","&"); while (sub("\\<"word"\\>","&"cnt--)); print }
' file
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

Answer

С GNU awk для многосимвольного RS, совпадения без учета регистра и границ слов:

$ awk -v RS='^$' -v ORS= -v word='cinema' '
    BEGIN { IGNORECASE=1 }
    { cnt=gsub("\\<"word"\\>","&"); while (sub("\\<"word"\\>","&"cnt--)); print }
' file
He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and afterwards discovered that it was more then two years since he last visited the cinema1.

Question 3

С учетом знаков препинания после слова.
Прямая нумерация:

word="cinema"
awk -v word="$word" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" ++count,$i) 
        }
      print 
    }' input-file

Обратная нумерация:

word="cinema"
count="$(awk -v word="$word" '
    { count += gsub(word, "") }
    END { print count }' input-file)"
awk -v word="$word" -v count="$count" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" count--, $i) 
        }
      print 
    }' input-file

Answer

С учетом знаков препинания после слова.
Прямая нумерация:

word="cinema"
awk -v word="$word" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" ++count,$i) 
        }
      print 
    }' input-file

Обратная нумерация:

word="cinema"
count="$(awk -v word="$word" '
    { count += gsub(word, "") }
    END { print count }' input-file)"
awk -v word="$word" -v count="$count" '
    { 
      for (i = 1; i <= NF; i++) 
        if ($i ~ word "([,.;:)]|$)") { 
          gsub(word, word "" count--, $i) 
        }
      print 
    }' input-file

Question 4

Для разметки слова в порядке убывания мы инвертируем регулярное выражение И инвертируем данные, а затем инвертируем дату еще раз, чтобы осуществить преобразование:

perl -l -0777pe '$_ = reverse reverse =~ s/(?=\bamenic\b)/++$a/gre' input.data

Результат

He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema1.

Для маркировки слова в порядке возрастания мы выполняем ретроспективный поиск слова:

perl -lpe 's/\bcinema\b\K/++$a/eg' input.data

Результат

He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema3.

Answer

Для разметки слова в порядке убывания мы инвертируем регулярное выражение И инвертируем данные, а затем инвертируем дату еще раз, чтобы осуществить преобразование:

perl -l -0777pe '$_ = reverse reverse =~ s/(?=\bamenic\b)/++$a/gre' input.data

Результат

He drove his car to the cinema3. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema1.

Для маркировки слова в порядке возрастания мы выполняем ретроспективный поиск слова:

perl -lpe 's/\bcinema\b\K/++$a/eg' input.data

Результат

He drove his car to the cinema1. He then went inside the cinema2 to purchase tickets, and
afterwards discovered that it was more then two years since he last visited the cinema3.

Как можно добавить инкрементный счетчик к каждому предопределенному слову текстового файла?

решение1

решение2

решение3

решение4

Результат

Результат

Связанный контент