60-Years of AI Compute >> Moore’s Law

December 18, 2019

Categories: AI – Alien and Artificial (419)

Tags: ai (146), asic (4), compute (4), gpu (6), hai (7), index (2), law (25), moores (18), openai (4), performance (43), Stanford (202), tpu (3)

Note the pivotal AI moment in 2012 when a neural net run on GPUs crushed the ImageNet competition. Deep Learning and non-CPU architectures blossomed.

From the AI Index 2019: “Prior to 2012, AI results closely tracked Moore’s Law, with compute doubling every two years. Post-2012, compute has been doubling every 3.4 months (for a net increase of ~10,000,000x since then).

The y-axis of the chart shows the total amount of compute, in petaflop/s-days, used to train selected results. A petaflop-day (pf-day) consists of performing 10^15 neural net operations per second for one day, or a total of about 10^20 operations.

In the past year and a half, the time required to train a network on cloud infrastructure has fallen from about three hours in October 2017 to about 88 seconds in July, 2019. During the same period, the cost to train such a system has fallen similarly.”

— p.65 of the AI Index from Stanford HAI (and the OpenAI graph of the recent red datapoints)

One response to “60-Years of AI Compute >> Moore’s Law”

jurvetson

Aug 21, 2021 at 4:32 pm

And an update to Moore’s Law, post Tesla AI Day

Reply

Leave a Reply Cancel reply