Bashを使用してテキストファイルからタグ付き文字列を抽出する

Question

最初は次のように考えましたawk。

awk -vRS='#[^#]+#' 'RT{gsub(/#/,"",RT);p[RT]=1}END{for(i in p)print i}' the_file

ただし、この決定は、あなたがしなければならない他の作業によって異なります。

説明するコメントでリクエストしたとおり。

awk -vRS='#[^#]+#' '   # use /#[^#]+#/ as record separator
RT {   # record terminator not empty?
  gsub(/#/,"",RT)    # remove the # parameter delimiter markup
  p[RT]=1   # store it as key in array p
}
END {   # end of input?
  for (i in p) print i   # loop through array p and print each key
}' the_file

重要な部分は、RT（レコードの終端）組み込み変数を使用することです。

   RT          The record terminator.  Gawk sets RT to the input text that
               matched the character or regular expression specified by
               RS.

Answer 1

最初は次のように考えましたawk。

awk -vRS='#[^#]+#' 'RT{gsub(/#/,"",RT);p[RT]=1}END{for(i in p)print i}' the_file

ただし、この決定は、あなたがしなければならない他の作業によって異なります。

説明するコメントでリクエストしたとおり。

awk -vRS='#[^#]+#' '   # use /#[^#]+#/ as record separator
RT {   # record terminator not empty?
  gsub(/#/,"",RT)    # remove the # parameter delimiter markup
  p[RT]=1   # store it as key in array p
}
END {   # end of input?
  for (i in p) print i   # loop through array p and print each key
}' the_file

重要な部分は、RT（レコードの終端）組み込み変数を使用することです。

   RT          The record terminator.  Gawk sets RT to the input text that
               matched the character or regular expression specified by
               RS.

Bashを使用してテキストファイルからタグ付き文字列を抽出する

ベストアンサー1

おすすめ記事