grep または sed を使用して HTML からリンクをフィルターする方法は?

Question

次のコマンドを試してください:

curl -s http://www.example.com | grep -Po '(?<=src=")[^"]*(jpg|png)'

説明:

からman grep：

   -o, --only-matching
          Print only the matched (non-empty) parts of a matching line,
          with each such part on a separate output line.
   -P, --perl-regexp
          Interpret PATTERN as a Perl compatible regular expression (PCRE)

後読みは(?<=src=)、文字列の現在の位置で、先行するのは文字であると主張します。次に、 jpg または png で終わるものsrc=を除くすべてを検索します。"

Answer 1

次のコマンドを試してください:

curl -s http://www.example.com | grep -Po '(?<=src=")[^"]*(jpg|png)'

説明:

からman grep：

   -o, --only-matching
          Print only the matched (non-empty) parts of a matching line,
          with each such part on a separate output line.
   -P, --perl-regexp
          Interpret PATTERN as a Perl compatible regular expression (PCRE)

後読みは(?<=src=)、文字列の現在の位置で、先行するのは文字であると主張します。次に、 jpg または png で終わるものsrc=を除くすべてを検索します。"

grep または sed を使用して HTML からリンクをフィルターする方法は?

答え1

関連情報