뒤죽박죽된 문장에서 단어의 위치를 찾는 방법은 무엇입니까?

Question 1

IMO에서 TeX의 가장 흥미로운 점은 조판이고 최악의 점은 프로그래밍 기능입니다. 따라서 TeX 외부에서(가능한 한 멀리!) 이러한 프로그래밍을 수행하고 조판에만 TeX를 사용하는 것이 가장 좋습니다. 모든 것이 그럴 수도 있다가능한TeX를 사용하지만 반드시 가장 쉽고 유지 관리가 쉬운 솔루션은 아닙니다.

그래도 TeX를 사용한다면 이런 종류의 프로그래밍은 LuaTeX를 사용하는 것이 더 쉽습니다(적어도 나에게는, 그리고 대부분의 사람들에게는 상상됩니다). 다음 파일을 컴파일합니다 lualatex("태그"는 선택 사항으로 지정했습니다. 와 같은 모든 단어에 태그를 지정 the(1) quick(2) ...하거나 중복된 단어에만 태그를 지정할 수 있음).

\documentclass[12pt]{memoir}
\usepackage{amsmath} % For \text

\newcommand{\printword}[2]{$\text{#1} ^ {#2}$\quad} % Or whatever formatting you like.
\newcommand{\linesep}{\newline}

\directlua{dofile('jumble.lua')}
\newcommand{\printjumble}[2]{
  \directlua{get_sentence1_lines()}{#1}
  \directlua{get_sentence2_words()}{#2}
  %
  \noindent
  Actual sentence:
  \newline
  \directlua{print_sentence1_lines()}

  \noindent
  Jumbled sentence:
  \textbf{\directlua{print_sentence2()}}
}

\begin{document}
\printjumble{
  the(1) quick brown fox
  +
  jumps over the(7) lazy dog
}{
  the(7) lazy dog jumps over the(1) quick brown fox
}
\end{document}

여기서 jumble.lua(동일한 .tex파일에 인라인될 수 있지만 별도로 유지하는 것을 선호함)는 다음과 같습니다.

-- Expected from TeX: before calling print_sentence1_lines(),
--     call get_sentence1_lines() and get_sentence2_words()
--     define \printword and \linesep.
-- Globals: sentence2_words, position_for_word, sentence1_lines

function get_sentence1_lines()
   sentence1_lines = token.scan_string()
end

function get_sentence2_words()
   local sentence2 = token.scan_string()
   sentence2_words = {}
   position_for_word = {}
   local i = 0
   for word in string.gmatch(sentence2, "%S+") do
      i = i + 1
      assert(position_for_word[word] == nil, string.format('Duplicate word: %s', word))
      sentence2_words[i] = without_tags(word)
      position_for_word[word] = i
   end
end

function print_sentence2()
   for i, word in ipairs(sentence2_words) do
      tex.print(word)
   end
end

function print_sentence1_lines()
   for line in string.gmatch(sentence1_lines, "[^+]+") do
      for word in string.gmatch(line, "%S+") do
         position = position_for_word[word]
         assert(position_for_word[word] ~= nil, string.format('New word: %s', word))
         tex.print(string.format([[\printword{%s}{%s}]], without_tags(word), position))
      end
      tex.print([[\linesep]])
   end
end

function without_tags(word)
   local new_word = string.gsub(word, "%(.*%)", "")
   return new_word
end

이는

질문에서와 같이.

물건을 옮겨서 조금 더 짧게 만들 수 있지만(예: 이 답변의 첫 번째 개정판 참조) 파일의 조판 지침 .tex과 .lua파일 의 프로그래밍을 (가능한 한 많이) 유지하는 것이 가장 깨끗하다고 생각합니다. .

Answer