フィールドと部分文字列を抽出し、ソートされた行をマージします。

Question

awkGNUの使用datamash:

awk 'BEGIN{ OFS=FS="\t" }
  NR>2{                       # skip first two records
    split($3, a, "/" )        # split $3 into array a on /
    domain=a[3]               # 3rd element is the domain name
    sub(/^www\./, "", domain) # remove www. prefix
    print domain, $4          # print domain and email
  }
' file | datamash -g 1 unique 2

このawkセクションでは、最初の2行をスキップし、すべての履歴のドメインと電子メールを印刷します。これは〜になります

a.com   [email protected]
a.com   [email protected]
b.fr    [email protected]
b.fr    [email protected]

その後、出力はdatamash最初のフィールドにパイプされ、入力をグループ化し、2番目のフィールドの固有値のカンマ区切りリストを印刷します。

出力：

a.com   [email protected]
b.fr    [email protected],[email protected]

タイトル行は練習用に予約されています。

Answer 1

awkGNUの使用datamash:

awk 'BEGIN{ OFS=FS="\t" }
  NR>2{                       # skip first two records
    split($3, a, "/" )        # split $3 into array a on /
    domain=a[3]               # 3rd element is the domain name
    sub(/^www\./, "", domain) # remove www. prefix
    print domain, $4          # print domain and email
  }
' file | datamash -g 1 unique 2

このawkセクションでは、最初の2行をスキップし、すべての履歴のドメインと電子メールを印刷します。これは〜になります

a.com   [email protected]
a.com   [email protected]
b.fr    [email protected]
b.fr    [email protected]

その後、出力はdatamash最初のフィールドにパイプされ、入力をグループ化し、2番目のフィールドの固有値のカンマ区切りリストを印刷します。

出力：

a.com   [email protected]
b.fr    [email protected],[email protected]

タイトル行は練習用に予約されています。

フィールドと部分文字列を抽出し、ソートされた行をマージします。

ベストアンサー1

おすすめ記事