CommunityNews

CommunityNews

Lessons Learned from Two Years as a Data Scientist

Intro

I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the last two years working as a data scientist, with the bulk of that time at Microsoft and the last two months at Anthropic. Not gonna lie — my time as a data scientist has been pretty awesome. I was technically on a product team at Microsoft in DevDiv working on improving developer tools like VS Code, but in practice it was like a flush research job with tons of freedom (to first approximation my instructions for the first six months were, “Fix bugs using machine learning. Go.”) I got to work with cutting edge transformer models and their application to source code, the most famous of which is autocompletion ala intellisense, tabnine, and Codex. Every two months the amount of compute I have access to has doubled, starting out with a 2013 macbook air and ending with a cluster with thousands of A100s (which, to be fair, I share with more people than my old laptop), and I got to collaborate on ten different papers.

I was extremely green when I started out. I hadn’t used python since cs1 as a college freshman, excluding one month where I worked through fast.ai and leetcode. Maybe the most telling anecdote is that I spent an embarrassingly long time using vim as my exclusive file editor… while not knowing any of the vim shortcuts besides :wq (write and quit).

Read in full here:

https://dawndrain.github.io/braindrain/two_years.html

This thread was posted by one of our members via one of our news source trackers.

First Post!

bot

bot

Corresponding tweet for this thread:

Share link for this tweet.

Popular Data Science topics Top

PragmaticBookshelf
Self-driving cars, natural language recognition, and online recommendation engines are all possible thanks to Machine Learning. Discover...
New
AstonJ
Probably only a delay of the inevitable - with facial-recognition systems already part of daily lives in countries such as China, how lon...
New
AstonJ
Is this the future of photography - or spying even :joy: Creating realistic 3D photo-models from a series of photographs - in this case ...
New
New
CommunityNews
Data. It’s everywhere and we’re only getting more of it. For the last 5-10 years, data science has attracted newcomers near and far tryin...
New
First poster: bot
Summary In this project, we Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automat...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
CommunityNews
“Markpainting” is a clever technique to watermark photos in such a way that makes it easier to detect ML-based manipulation: An image o...
New
First poster: bot
The Modern Mathematics of Deep Learning. We describe the new field of mathematical analysis of deep learning. This field emerged around...
New
First poster: bot
Intro I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the la...
New

Other popular topics Top

Devtalk
Reading something? Working on something? Planning something? Changing jobs even!? If you’re up for sharing, please let us know what you’...
1017 16952 374
New
axelson
I’ve been really enjoying obsidian.md: It is very snappy (even though it is based on Electron). I love that it is all local by defaul...
New
Exadra37
Oh just spent so much time on this to discover now that RancherOS is in end of life but Rancher is refusing to mark the Github repo as su...
New
New
wmnnd
Here’s the story how one of the world’s first production deployments of LiveView came to be - and how trying to improve it almost caused ...
New
rustkas
Intensively researching Erlang books and additional resources on it, I have found that the topic of using Regular Expressions is either c...
New
PragmaticBookshelf
Build efficient applications that exploit the unique benefits of a pure functional language, learning from an engineer who uses Haskell t...
New
PragmaticBookshelf
Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...
New
PragmaticBookshelf
Author Spotlight: Tammy Coron @Paradox927 Gaming, and writing games in particular, is about passion, vision, experience, and immersio...
New
AstonJ
This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...
New