Keras : RNN の入力データはどのように準備すればよいですか? 質問する

Question

最新の 5 つの入力を使用して出力を予測するだけの場合は、トレーニングサンプルの 600 タイムステップ全体を提供する必要はありません。トレーニングデータを次の方法で渡すことをお勧めします。

             t=0  t=1  t=2  t=3  t=4  t=5  ...  t=598  t=599
sample0      |---------------------|
sample0           |---------------------|
sample0                |-----------------
...
sample0                                         ----|
sample0                                         ----------|
sample1      |---------------------|
sample1           |---------------------|
sample1                |-----------------
....
....
sample6751                                      ----|
sample6751                                      ----------|

トレーニングシーケンスの合計数は

(600 - 4) * 6752 = 4024192    # (nb_timesteps - discarded_tailing_timesteps) * nb_samples

各トレーニングシーケンスは 5 つのタイムステップで構成されます。各シーケンスの各タイムステップで、特徴ベクトルの 13 要素すべてを渡します。その結果、トレーニングデータの形状は (4024192, 5, 13) になります。

このループにより、データの形状を変更できます。

input = np.random.rand(6752,600,13)
nb_timesteps = 5

flag = 0

for sample in range(input.shape[0]):
    tmp = np.array([input[sample,i:i+nb_timesteps,:] for i in range(input.shape[1] - nb_timesteps + 1)])

    if flag==0:
        new_input = tmp
        flag = 1

    else:
        new_input = np.concatenate((new_input,tmp))

Answer 1

最新の 5 つの入力を使用して出力を予測するだけの場合は、トレーニングサンプルの 600 タイムステップ全体を提供する必要はありません。トレーニングデータを次の方法で渡すことをお勧めします。

             t=0  t=1  t=2  t=3  t=4  t=5  ...  t=598  t=599
sample0      |---------------------|
sample0           |---------------------|
sample0                |-----------------
...
sample0                                         ----|
sample0                                         ----------|
sample1      |---------------------|
sample1           |---------------------|
sample1                |-----------------
....
....
sample6751                                      ----|
sample6751                                      ----------|

トレーニングシーケンスの合計数は

(600 - 4) * 6752 = 4024192    # (nb_timesteps - discarded_tailing_timesteps) * nb_samples

各トレーニングシーケンスは 5 つのタイムステップで構成されます。各シーケンスの各タイムステップで、特徴ベクトルの 13 要素すべてを渡します。その結果、トレーニングデータの形状は (4024192, 5, 13) になります。

このループにより、データの形状を変更できます。

input = np.random.rand(6752,600,13)
nb_timesteps = 5

flag = 0

for sample in range(input.shape[0]):
    tmp = np.array([input[sample,i:i+nb_timesteps,:] for i in range(input.shape[1] - nb_timesteps + 1)])

    if flag==0:
        new_input = tmp
        flag = 1

    else:
        new_input = np.concatenate((new_input,tmp))

Keras : RNN の入力データはどのように準備すればよいですか? 質問する

ベストアンサー1

おすすめ記事