意味がごちゃごちゃした文章の中で単語の位置を見つけるにはどうすればいいでしょうか?

Question 1

私の意見では、TeXの最も興味深い点はタイプセッティングであり、最も悪い点はプログラミング機能です。そのため、プログラミングはTeXの外で（できるだけ遠くで！）行い、TeXはタイプセッティングのみに使用するのが最善です。可能TeX では可能ですが、必ずしも最も簡単で保守しやすいソリューションとは限りません。

それでも、TeX を使用する場合、この種のプログラミングは LuaTeX で行う方が簡単です (少なくとも私にとっては、そしてほとんどの人にとってはそうだと思います)。次のファイルをコンパイルしますlualatex(「タグ」はオプションにしました。のようにすべての単語にタグを付けたりthe(1) quick(2) ...、重複する単語だけにタグを付けたりできます)。

\documentclass[12pt]{memoir}
\usepackage{amsmath} % For \text

\newcommand{\printword}[2]{$\text{#1} ^ {#2}$\quad} % Or whatever formatting you like.
\newcommand{\linesep}{\newline}

\directlua{dofile('jumble.lua')}
\newcommand{\printjumble}[2]{
  \directlua{get_sentence1_lines()}{#1}
  \directlua{get_sentence2_words()}{#2}
  %
  \noindent
  Actual sentence:
  \newline
  \directlua{print_sentence1_lines()}

  \noindent
  Jumbled sentence:
  \textbf{\directlua{print_sentence2()}}
}

\begin{document}
\printjumble{
  the(1) quick brown fox
  +
  jumps over the(7) lazy dog
}{
  the(7) lazy dog jumps over the(1) quick brown fox
}
\end{document}

ここで、jumble.lua(同じ.texファイルにインライン化することもできますが、別々にしておくことを好みます) は次のとおりです。

-- Expected from TeX: before calling print_sentence1_lines(),
--     call get_sentence1_lines() and get_sentence2_words()
--     define \printword and \linesep.
-- Globals: sentence2_words, position_for_word, sentence1_lines

function get_sentence1_lines()
   sentence1_lines = token.scan_string()
end

function get_sentence2_words()
   local sentence2 = token.scan_string()
   sentence2_words = {}
   position_for_word = {}
   local i = 0
   for word in string.gmatch(sentence2, "%S+") do
      i = i + 1
      assert(position_for_word[word] == nil, string.format('Duplicate word: %s', word))
      sentence2_words[i] = without_tags(word)
      position_for_word[word] = i
   end
end

function print_sentence2()
   for i, word in ipairs(sentence2_words) do
      tex.print(word)
   end
end

function print_sentence1_lines()
   for line in string.gmatch(sentence1_lines, "[^+]+") do
      for word in string.gmatch(line, "%S+") do
         position = position_for_word[word]
         assert(position_for_word[word] ~= nil, string.format('New word: %s', word))
         tex.print(string.format([[\printword{%s}{%s}]], without_tags(word), position))
      end
      tex.print([[\linesep]])
   end
end

function without_tags(word)
   local new_word = string.gsub(word, "%(.*%)", "")
   return new_word
end

これにより

質問にあるように。

.tex注意: 内容を移動することでこれをもう少し短くすることができます (たとえば、この回答の最初の改訂版を参照) が、タイプセットの指示とプログラミングを(可能な限り).luaファイル内に保持するのが最もきれいだと思います。

Answer