How to learn about a new domain as a data scientist
parachute.png
“The best thing about being a statistician is that you get to play in everyone’s backyard.”
John Tukey
As a data scientist, your skill set is powerful because it is very generally applicable. ML models and statistics give us ways to use data to understand the world and make decisions a little better, regardless of the domain
Despite this generality, YOU NEED TO UNDERSTAND THE DOMAIN TO DO A GOOD JOB. I’ve developed a method that I use to do this
Over the last ten years, I’ve had a lot of jobs, and a lot of teams. I’ve been lucky! I’ve been able to work with so many interesting people, solving lots of interesting problems . even if you don’t yet have my ~level of jaded frustration~ wealth of experience, maybe you:
this post is meant to help you X. it is idiosyncratic, and reflects what has worked for me (it’s what I used most recently to onboard at google)
it has three major components, performed roughly in order:
Talk talk talk talk talk!! you can’t overdo this
Each of these includes themes that I try and piece together, which are included in bold. Each theme includes some specific questions you might ask. I try to cover all the themes, but I don’t necessarily ask each question (and I may ask others besides).
There are three big things:
I’ve written this for organizations which use the PM-Analyst-Eng structure. your org may have only a subset, or other ones besides, or something else
Titles like: DS, Analyst, AE, Product Analyst
Titles like: Product Manager, Marketer, Designer
sentence
Titles like: Software Engineer, Release Engineer, Eng manager
sentence
You should have some examples of: entities interactions value etc
The goal
1
2
3
Wrap up exercise
The goal
1
2
3
Wrap up exercise
https://bpb-us-w2.wpmucdn.com/sites.umassd.edu/dist/8/754/files/2019/01/Deconstructing-Statistical-Questions.pdf
chris chatfield