bash 替換變數中的特殊字符

Question

您可能有一個“bom”（位元組順序標記，在基於 unicode 語言環境的系統上使用，用於指定係統的“little-endian”/“big-endian”性質

看https://en.wikipedia.org/wiki/Byte_order_mark

值得慶幸的是，這個似乎適用於 utf-8 語言環境，如果您只期望 ASCII 1-177 個字符，這是一件好事...

您可以透過插入一個被迫（暫時）使用 C 語言環境的 sed 來將其刪除，以便「看到」以下內容：

LC_ALL=C sed '1s/^\xEF\xBB\xBF//'

例如用作：

incoming program | LC_ALL=C sed '1s/^\xEF\xBB\xBF//' | somecmd
 # or
< incomingfile LC_ALL=C sed '1s/^\xEF\xBB\xBF//' > outputfile
  #  <incomingfile  : will give "incomingfile" content as stdin to sed 
  # then sed modifies only the first line, replacing the BOM with ""
  #    (the rest is not touched by sed and is transmitted as-is)
  #  > outputfile : directs sed output (ie, incomingfile without the BOM) to "outputfile"

Answer 1

您可能有一個“bom”（位元組順序標記，在基於 unicode 語言環境的系統上使用，用於指定係統的“little-endian”/“big-endian”性質

看https://en.wikipedia.org/wiki/Byte_order_mark

值得慶幸的是，這個似乎適用於 utf-8 語言環境，如果您只期望 ASCII 1-177 個字符，這是一件好事...

您可以透過插入一個被迫（暫時）使用 C 語言環境的 sed 來將其刪除，以便「看到」以下內容：

LC_ALL=C sed '1s/^\xEF\xBB\xBF//'

例如用作：

incoming program | LC_ALL=C sed '1s/^\xEF\xBB\xBF//' | somecmd
 # or
< incomingfile LC_ALL=C sed '1s/^\xEF\xBB\xBF//' > outputfile
  #  <incomingfile  : will give "incomingfile" content as stdin to sed 
  # then sed modifies only the first line, replacing the BOM with ""
  #    (the rest is not touched by sed and is transmitted as-is)
  #  > outputfile : directs sed output (ie, incomingfile without the BOM) to "outputfile"

bash 替換變數中的特殊字符

答案1

相關內容