テキスト処理速度の向上

Question

まず、名前付きファイルから36行のヘッダーを抽出し、ファイルの残りinputの部分で60000行をランダムに選択し、同じ行を複数回ランダムに選択できます。すべての出力はoutput。

shufGNU coreutilsの使用:

#!/bin/sh

# Fetch header (36 first lines)
head -n 36 <input >output

# Scramble the other lines and pick 60000 (allowing for repeated lines)
tail -n +37 <input | shuf -r -n 60000 >>output

または：

( head -n 36 <input; tail -n +37 <input | shuf -r -n 60000 ) >output

GNU を使用すると、head出力の最後の行の直後に入力ファイルストリームが保持されます。言い換えれば、読み取りが終了した場所から続行shufできます（この機能は機能しない可能性があります）。head一部非GNUhead実装）：

( head -n 36; shuf -r -n 60000 ) <input >output

Answer 1

まず、名前付きファイルから36行のヘッダーを抽出し、ファイルの残りinputの部分で60000行をランダムに選択し、同じ行を複数回ランダムに選択できます。すべての出力はoutput。

shufGNU coreutilsの使用:

#!/bin/sh

# Fetch header (36 first lines)
head -n 36 <input >output

# Scramble the other lines and pick 60000 (allowing for repeated lines)
tail -n +37 <input | shuf -r -n 60000 >>output

または：

( head -n 36 <input; tail -n +37 <input | shuf -r -n 60000 ) >output

GNU を使用すると、head出力の最後の行の直後に入力ファイルストリームが保持されます。言い換えれば、読み取りが終了した場所から続行shufできます（この機能は機能しない可能性があります）。head一部非GNUhead実装）：

( head -n 36; shuf -r -n 60000 ) <input >output

テキスト処理速度の向上

ベストアンサー1

おすすめ記事