CommunityNews

CommunityNews

Lessons Learned from Two Years as a Data Scientist

Intro

I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the last two years working as a data scientist, with the bulk of that time at Microsoft and the last two months at Anthropic. Not gonna lie — my time as a data scientist has been pretty awesome. I was technically on a product team at Microsoft in DevDiv working on improving developer tools like VS Code, but in practice it was like a flush research job with tons of freedom (to first approximation my instructions for the first six months were, “Fix bugs using machine learning. Go.”) I got to work with cutting edge transformer models and their application to source code, the most famous of which is autocompletion ala intellisense, tabnine, and Codex. Every two months the amount of compute I have access to has doubled, starting out with a 2013 macbook air and ending with a cluster with thousands of A100s (which, to be fair, I share with more people than my old laptop), and I got to collaborate on ten different papers.

I was extremely green when I started out. I hadn’t used python since cs1 as a college freshman, excluding one month where I worked through fast.ai and leetcode. Maybe the most telling anecdote is that I spent an embarrassingly long time using vim as my exclusive file editor… while not knowing any of the vim shortcuts besides :wq (write and quit).

Read in full here:

https://dawndrain.github.io/braindrain/two_years.html

This thread was posted by one of our members via one of our news source trackers.

Where Next?

Popular Other Fields topics Top

New
CommunityNews
Data. It’s everywhere and we’re only getting more of it. For the last 5-10 years, data science has attracted newcomers near and far tryin...
New
AstonJ
Just listening now… details revealed in Thinking Elixir’s podcast: José Valim visits and finally publicly reveals what Project Nx is! H...
New
First poster: bot
Adversarial.io is an easy-to-use webapp for altering image material, in order to make it machine-unreadable. It works best with 299 x 29...
New
First poster: bot
What is Logica? Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, ...
New
CommunityNews
“Markpainting” is a clever technique to watermark photos in such a way that makes it easier to detect ML-based manipulation: An image o...
New
First poster: bot
Intro I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the la...
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
ManningBooks
The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of rel...
New
ManningBooks
With Grokking Statistics, you’ll build a strong foundation in statistical analysis by working through engaging mini projects that put eac...
New

Other popular topics Top

AstonJ
What chair do you have while working… and why? Is there a ‘best’ type of chair or working position for developers?
New
wolf4earth
@AstonJ prompted me to open this topic after I mentioned in the lockdown thread how I started to do a lot more for my fitness. https://f...
New
Exadra37
Please tell us what is your preferred monitor setup for programming(not gaming) and why you have chosen it. Does your monitor have eye p...
New
siddhant3030
I’m thinking of buying a monitor that I can rotate to use as a vertical monitor? Also, I want to know if someone is using it for program...
New
dimitarvp
Small essay with thoughts on macOS vs. Linux: I know @Exadra37 is just waiting around the corner to scream at me “I TOLD YOU SO!!!” but I...
New
Exadra37
I am asking for any distro that only has the bare-bones to be able to get a shell in the server and then just install the packages as we ...
New
AstonJ
In case anyone else is wondering why Ruby 3 doesn’t show when you do asdf list-all ruby :man_facepalming: do this first: asdf plugin-upd...
New
PragmaticBookshelf
Build highly interactive applications without ever leaving Elixir, the way the experts do. Let LiveView take care of performance, scalabi...
New
DevotionGeo
The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...
New
Margaret
Ask Me Anything with Mark Volkmann @mvolkmann On February 24 and 25, we are giving you a chance to ask questions of PragProg author M...
New