CommunityNews

CommunityNews

Using OpenGL instead of CUDA for machine learning

Summary

In this project, we

  1. Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automatically compiles a Python program into an OpenGL shader that runs on the GPU on a computer that does not have CUDA.

  2. Explored optimizations of OpenGL shader programs so that a fundamental computation task needed in machine learning - matrix multiplication - has comparable performance with OpenCL on the same machine.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

Where Next?

Popular Other Fields topics Top

New
First poster: bot
Adversarial.io is an easy-to-use webapp for altering image material, in order to make it machine-unreadable. It works best with 299 x 29...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
First poster: bot
The Modern Mathematics of Deep Learning. We describe the new field of mathematical analysis of deep learning. This field emerged around...
New
First poster: bot
The Future of Deep Learning Is Photonic. Computing with light could slash the energy needs of neural networks
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
ManningBooks
The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of rel...
New
ManningBooks
With Grokking Statistics, you’ll build a strong foundation in statistical analysis by working through engaging mini projects that put eac...
New
ManningBooks
DAX Reimagined isn’t just another beginner’s guide to the powerful DAX language. This unique book teaches you how to work with the engine...
New
ManningBooks
Timeless Algorithms: The Seminal Papers explains both the how and the why of the most important data science algorithms. Along with the t...
New

Other popular topics Top

Devtalk
Reading something? Working on something? Planning something? Changing jobs even!? If you’re up for sharing, please let us know what you’...
1052 21915 398
New
PragmaticBookshelf
Stop developing web apps with yesterday’s tools. Today, developers are increasingly adopting Clojure as a web-development platform. See f...
New
PragmaticBookshelf
Brace yourself for a fun challenge: build a photorealistic 3D renderer from scratch! In just a couple of weeks, build a ray tracer that r...
New
DevotionGeo
I know that these benchmarks might not be the exact picture of real-world scenario, but still I expect a Rust web framework performing a ...
New
New
AstonJ
We’ve talked about his book briefly here but it is quickly becoming obsolete - so he’s decided to create a series of 7 podcasts, the firs...
New
PragmaticBookshelf
Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...
New
PragmaticBookshelf
Get the comprehensive, insider information you need for Rails 8 with the new edition of this award-winning classic. Sam Ruby @rubys ...
New
AstonJ
This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...
New
AstonJ
Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...
New