bash substituindo caracteres especiais em uma variável

Question

Você provavelmente tem um "bom" (marca de ordem de byte, usado em sistemas baseados em localidade unicode para especificar o "little-endian"/"big-endian" do sistema

verhttps://en.wikipedia.org/wiki/Byte_order_mark

Felizmente, esse parece ser para o código de idioma utf-8, o que é bom se você espera apenas caracteres ASCII 1-177...

Você poderia retirá-lo interpondo um sed que foi forçado a usar (temporariamente) a localidade C para "ver" isto:

LC_ALL=C sed '1s/^\xEF\xBB\xBF//'

usado por exemplo como:

incoming program | LC_ALL=C sed '1s/^\xEF\xBB\xBF//' | somecmd
 # or
< incomingfile LC_ALL=C sed '1s/^\xEF\xBB\xBF//' > outputfile
  #  <incomingfile  : will give "incomingfile" content as stdin to sed 
  # then sed modifies only the first line, replacing the BOM with ""
  #    (the rest is not touched by sed and is transmitted as-is)
  #  > outputfile : directs sed output (ie, incomingfile without the BOM) to "outputfile"

Answer 1