CommunityNews

Benchmarking Neural Network Training Algorithms

Benchmarking Neural Network Training Algorithms.
Training algorithms, broadly construed, are an essential part of every deep
learning pipeline. Training algorithm improvements that speed up training
across a wide variety of workloads (e.g., better update rules, tuning
protocols, learning rate schedules, or data selection schemes) could save time,
save computational resources, and lead to better, more accurate, models.
Unfortunately, as a community, we are currently unable to reliably identify
training algorithm improvements, or even determine the state-of-the-art
training algorithm. In this work, using concrete experiments, we argue that
real progress in speeding up training requires new benchmarks that resolve
three basic challenges faced by empirical comparisons of training algorithms:
(1) how to decide when training is complete and precisely measure training
time, (2) how to handle the sensitivity of measurements to exact workload
details, and (3) how to fairly compare algorithms that require hyperparameter
tuning. In order to address these challenges, we introduce a new, competitive,
time-to-result benchmark using multiple workloads running on fixed hardware,
the AlgoPerf: Training Algorithms benchmark. Our benchmark includes a set of
workload variants that make it possible to detect benchmark submissions that
are more robust to workload changes than current widely-used methods. Finally,
we evaluate baseline submissions constructed using various optimizers that
represent current practice, as well as other optimizers that have recently
received attention in the literature. These baseline results collectively
demonstrate the feasibility of our benchmark, show that non-trivial gaps
between methods exist, and set a provisional state-of-the-art for future
benchmark submissions to try and surpass.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#algorithms #training

0 453 0

2023-06-16 00:30:05 UTC

Where Next?

View thread on forum

algorithms

training

Home General Dev>In The News

#algorithms #training

0 453 0

Last post

Popular General Dev topics

General Dev>In The News

SkiftOS: Simple, handmade operating system for the x86 platform

skiftOS is a simple, handmade operating system for the x86 platform, aiming for clean and pretty APIs while keeping the spirit of UNIX. s...

github.com

#skiftos

2 1426 3

2021-01-28 14:47:06 UTC

New

General Dev>In The News

DOD: Guidance on Software Development and Open Source Software (pdf)

MEMORANDUM FOR SENIOR PENTAGON LEADERSHIP COMMANDANT OF THE COAST GUARD COMMANDERS OF THE COMBATANT COMMANDS DEFENSE AGENCY AND DOD FIEL...

dodcio.defense.gov

#development #pdf

0 1646 0

2022-01-27 14:32:09 UTC

New

General Dev>In The News

There’s No Such Thing as Clean Code

Everyone seems to be striving for ‘clean’ code at the moment. You can’t read a blog post without the author telling you how clean their a...

steveonstuff.com

#code

31 1262 9

2022-03-28 00:29:57 UTC

New

General Dev>In The News

Helix, a Kakoune inspired Vim-model text editor (written in Rust)

Yet another rust-made text editor, though I’m really liking the looks of how this one works!

/rust

5 2078 1

2022-03-30 14:44:03 UTC

New

General Dev>In The News

Quick Start Guide for Flipper Zero

Flipper Zero is a portable multi-tool for pentesters and geeks in a toy-like body. It loves hacking digital stuff, such as radio protocol...

blog.flipperzero.one

#guide

0 1129 0

2022-05-15 13:56:21 UTC

New

General Dev>In The News

Whatever happened to Elm, anyway?

Whatever happened to Elm, anyway?. I see this question pop up quite frequently in lots of different arenas - folks are curious as to wha...

derw.substack.com

/elm

17 1013 12

2025-04-21 03:57:49 UTC

New

General Dev>In The News

Why I like Clojure as a solo developer | Biff

Why I like Clojure as a solo developer | Biff. Most of the reasons fall into a few categories: data orientation, the JVM, and the REPL.

biffweb.com

/clojure

2 1071 2

2023-04-23 01:18:47 UTC

New

General Dev>In The News

SLUM: The Shadow Library Uptime Monitor

SLUM: The Shadow Library Uptime Monitor. This dashboard tracks the availability of popular shadow libraries in real time from a US-based...

open-slum.org

#library #monitor

0 648 0

2025-01-19 20:46:27 UTC

New

General Dev>In The News

olmOCR – Open-Source OCR for Accurate Document Conversion

olmOCR is an open-source tool for converting PDFs to text with high accuracy, preserving reading order and supporting tables, equations, ...

olmocr.allenai.org

2 412 1

2025-03-09 05:08:33 UTC

New

General Dev>In The News

JavaScript Fatigue Strikes Back

The new frameworks will continue until morale improves.

allenpike.com

/js

6 274 5

2025-03-24 16:52:46 UTC

New

Other popular topics

General Dev>Hardware

Which keyboard do you have?

If it’s a mechanical keyboard, which switches do you have? Would you recommend it? Why? What will your next keyboard be? Pics always w...

#hardware /keyboards #sticky #mechanical-keyboards

144 8502 50

2021-01-07 23:58:36 UTC

New

General Dev>Code Editors

SpaceVim vs SpaceMacs

SpaceVim seems to be gaining in features and popularity and I just wondered how it compares with SpaceMacs in 2020 - anyone have any thou...

/vim #spacevim #spacemacs /emacs #code-editors

30 3579 14

2020-08-27 17:53:29 UTC

New

General Dev>Hardware

Poll: Which keyboard layout do you use?

poll poll Be sure to check out @Dusty’s article posted here: An Introduction to Alternative Keyboard Layouts It’s one of the best write-...

colemakmods.github.io

#polls /keyboards

10 5348 11

2020-10-31 23:12:33 UTC

New

General Dev>Code Editors

Dendron: a personal knowledge management tool on top of VSCode

/vscode #visual-studio-code

28 6034 9

2021-05-05 12:15:29 UTC

New

Community>In The Spotlight

Spotlight: Noel Rappin (Author)

“Finding the Boundaries” Hero’s Journey with Noel Rappin @noelrappin Even when you’re ultimately right about what the future ho...

#author-spotlight #web-development /rails /book-modern-front-end-development-for-rails /book-rails-5-test-prescriptions

34 3841 21

2021-02-11 12:34:07 UTC

New

Backend>Questions

Please tell me how to write a query for this in nodejs

API 4 Path: /user/following/ Method: GET Description: Returns the list of all names of people whom the user follows Response [ { ...

/nodejs

7 3059 3

2021-06-23 23:49:49 UTC

New

Science/Tech>Health & Diet

David Sinclair's new Lifespan podcast

We’ve talked about his book briefly here but it is quickly becoming obsolete - so he’s decided to create a series of 7 podcasts, the firs...

#health #podcasts #bio-hackers #david-sinclair

87 6021 49

2022-04-12 16:27:36 UTC

New

Community>In The Spotlight

Spotlight: Mike Riley (Author) Interview and AMA!

Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...

#author-spotlight /python #iot /book-portable-python-projects #internet-of-things

62 6351 19

2022-06-09 14:01:01 UTC

New

Android>Questions

Unresolved Reference to android in build.gradle.kts – Beginner Issue

Hello, I’m a beginner in Android development and I’m facing an issue with my project setup. In my build.gradle.kts file, I have the foll...

#binding

0 2183 2

2024-12-09 21:07:33 UTC

New

AI>In The News

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...

#ai #macs /deepseek

0 3570 1

2025-01-29 18:43:37 UTC

New

General Dev>In The News

2000 words about arrays and tables

General Dev>In The News

Optician Sans – Free font based on historical optotypes

General Dev>In The News

The Hype is the Product

General Dev>In The News

Writing memory efficient C structs

General Dev>In The News

Opsqueue: lightweight batch processing queue for heavy loads

General Dev>In The News

The Secret Stanford Program No One's Heard About

General Dev>In The News

The UK is slogging through an online age-gate apocalypse

General Dev>In The News

Weather Model based on ADS-B

General Dev>In The News

Software Development at 800 Words Per Minute | Dickson Tan's blog

General Dev>In The News

How I hacked my washing machine - Nex's Blog

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

TypeScript v5.9.2 released!

Frontend>Official News

Tutorial Deploy Phoenix 1.8 with Coolify on Hetzner

Backend>Blogs/Talks

Djangonaut Space is looking for contributors to be mentors

Backend>Official News

PostgreSQL: CloudNativePG 1.26.1, 1.25.3 and 1.27.0-rc1 Released!

Backend>Official News

Symfony v7.3.2, v7.2.9 and v6.4.24 released!

Backend>Official News

Gleam v1.12.0-rc3 released!

Backend>Official News

Deno v2.4.3 released!

Frontend>Official News

2000 words about arrays and tables

General Dev>In The News

Optician Sans – Free font based on historical optotypes

General Dev>In The News

Crush: The glamourous AI coding agent for your favourite terminal 💘

AI>In The News

The Hype is the Product

General Dev>In The News

New Debian Developers and Maintainers (May and June 2025)

Linux>Official News

A Python dict that can report which keys you did not use - Peterbe.com

Backend>In The News

The HTML Hobbyist

Frontend>In The News

Writing memory efficient C structs

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Benchmarking Neural Network Training Algorithms

CommunityNews

Benchmarking Neural Network Training Algorithms

Where Next?

Popular General Dev topics

SkiftOS: Simple, handmade operating system for the x86 platform

DOD: Guidance on Software Development and Open Source Software (pdf)

There’s No Such Thing as Clean Code

Helix, a Kakoune inspired Vim-model text editor (written in Rust)

Quick Start Guide for Flipper Zero

Whatever happened to Elm, anyway?

Why I like Clojure as a solo developer | Biff

SLUM: The Shadow Library Uptime Monitor

olmOCR – Open-Source OCR for Accurate Document Conversion

JavaScript Fatigue Strikes Back

Other popular topics

Which keyboard do you have?

SpaceVim vs SpaceMacs

Poll: Which keyboard layout do you use?

Dendron: a personal knowledge management tool on top of VSCode

Spotlight: Noel Rappin (Author)

Please tell me how to write a query for this in nodejs

David Sinclair's new Lifespan podcast

Spotlight: Mike Riley (Author) Interview and AMA!

Unresolved Reference to android in build.gradle.kts – Beginner Issue

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta