![\MakeShortVerb{\§} 和 \usepackage[utf8]{inputenc}](https://rvso.com/image/330816/%5CMakeShortVerb%7B%5C%C2%A7%7D%20%E5%92%8C%20%5Cusepackage%5Butf8%5D%7Binputenc%7D.png)
我有一本十多年前出版的書。它的序言中包含以下程式碼:
\documentclass{article}
\usepackage[cp1251]{inputenc}
\usepackage[T1,T2A]{fontenc}
\usepackage[english,russian]{babel}
\usepackage{shortvrb}
\MakeShortVerb{\§}
\begin{document}
§\begin{i}§
\end{document}
現在我想將原始檔轉換為unicode。但是將cp1251
編碼更改為之後utf8
\usepackage[utf8]{inputenc}
編譯停止並顯示錯誤訊息,指示有以下問題\MakeShortVerb{\§}
:
! Missing \endcsname inserted.
<to be read again>
\protect
l.8 \MakeShortVerb{\В§}
? h
The control sequence marked <to be read again> should
not appear between \csname and \endcsname.
? h
Sorry, I already gave what help I could...
Maybe you should try asking a human?
An error might have occurred before I noticed any problems.
``If all else fails, read the instructions.''
?
! Package inputenc Error: Keyboard character used is undefined
(inputenc) in inputencoding `utf8'.
See the inputenc package documentation for explanation.
Type H <return> for immediate help.
...
l.8 \MakeShortVerb{\В§}
? r
如何繞過這個問題?沒什麼好說的,我仍然想用作§
短逐字文字的分隔符號。shortverb
包與utf8
編碼相容嗎?
答案1
@egreg 又太快了。但我已經準備好午餐了...
\documentclass{article}
\usepackage[T1,T2A]{fontenc}
\usepackage[english,russian]{babel}
\usepackage[utf8]{inputenc}
%\usepackage{shortvrb}
\makeatletter
\DeclareUnicodeCharacter{00A7}{\IgorSVerb}
\def\IgorSVerb{\begingroup\def\IgorSVerb{\verb@egroup\endgroup}\verb^^a7}
\makeatother
\begin{document}
Hello
§\begin{i}$&^\}{"'çÂ\]%§
§\begin{i}$&^\}{"'çÂ\]%§
\selectlanguage{english}
§\begin{i}$&^\}{"'çÂ\]%§
§\begin{i}$&^\}{"'çÂ\]%§<
\end{document}
答案2
問題是,在 UTF-8 中,§
是兩個位元組長,但\MakeShortVerb
只需要一個。
我能提供的最好的資訊如下:
\documentclass{article}
\usepackage[utf8]{inputenc}
\usepackage[T1,T2A]{fontenc}
\usepackage[english,russian]{babel}
\begingroup\uccode`~="C2 \uppercase{\endgroup
\DeclareUnicodeCharacter{00A7}{\verb~}}
\begingroup\uccode`~="A7 \uppercase{\endgroup\def~}{}
\begin{document}
§\begin{i}§
§{-{\§
\end{document}
限制是 UTF-8 中以 為前綴的字元不能<C2>
出現在逐字文字中:禁止字元清單為
¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿
即 Unicode 範圍00A1
– 00BF
.