イテレータを使用する最も高速な（最もPython的な）方法質問する

Question

副作用のためだけにマップオブジェクトを作成するべきではありませんが、実際にはイテレータを使用する標準的なレシピがあります。itertoolsドキュメント:

def consume(iterator, n=None):
    "Advance the iterator n-steps ahead. If n is None, consume entirely."
    # Use functions that consume iterators at C speed.
    if n is None:
        # feed the entire iterator into a zero-length deque
        collections.deque(iterator, maxlen=0)
    else:
        # advance to the empty slice starting at position n
        next(islice(iterator, n, n), None)

「完全に消費する」ケースだけの場合、これは次のように簡略化できます。

def consume(iterator):
    collections.deque(iterator, maxlen=0)

この方法を使用するとcollections.deque、すべての要素を保存する必要がなくなり（maxlen=0）、バイトコード解釈のオーバーヘッドなしでCの速度で反復処理が行われます。専用高速パスmaxlen=0deque を使用してイテレータを消費するための deque 実装。

タイミング：

In [1]: import collections

In [2]: x = range(1000)

In [3]: %%timeit
   ...: i = iter(x)
   ...: for _ in i:
   ...:     pass
   ...: 
16.5 µs ± 829 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

In [4]: %%timeit
   ...: i = iter(x)
   ...: collections.deque(i, maxlen=0)
   ...: 
12 µs ± 566 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

もちろん、これはすべてCPythonに基づいています。インタープリタのオーバーヘッドの性質は他のPython実装とはまったく異なり、maxlen=0高速パスはCPythonに特有のものです。abarnertの回答他の Python 実装の場合。

Answer 1

副作用のためだけにマップオブジェクトを作成するべきではありませんが、実際にはイテレータを使用する標準的なレシピがあります。itertoolsドキュメント:

def consume(iterator, n=None):
    "Advance the iterator n-steps ahead. If n is None, consume entirely."
    # Use functions that consume iterators at C speed.
    if n is None:
        # feed the entire iterator into a zero-length deque
        collections.deque(iterator, maxlen=0)
    else:
        # advance to the empty slice starting at position n
        next(islice(iterator, n, n), None)

「完全に消費する」ケースだけの場合、これは次のように簡略化できます。

def consume(iterator):
    collections.deque(iterator, maxlen=0)

この方法を使用するとcollections.deque、すべての要素を保存する必要がなくなり（maxlen=0）、バイトコード解釈のオーバーヘッドなしでCの速度で反復処理が行われます。専用高速パスmaxlen=0deque を使用してイテレータを消費するための deque 実装。

タイミング：

In [1]: import collections

In [2]: x = range(1000)

In [3]: %%timeit
   ...: i = iter(x)
   ...: for _ in i:
   ...:     pass
   ...: 
16.5 µs ± 829 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

In [4]: %%timeit
   ...: i = iter(x)
   ...: collections.deque(i, maxlen=0)
   ...: 
12 µs ± 566 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

もちろん、これはすべてCPythonに基づいています。インタープリタのオーバーヘッドの性質は他のPython実装とはまったく異なり、maxlen=0高速パスはCPythonに特有のものです。abarnertの回答他の Python 実装の場合。

イテレータを使用する最も高速な（最もPython的な）方法質問する

基準

ベストアンサー1

おすすめ記事