John Foreman, Data Scientist
  • Home
  • Data Smart book
  • Speaking & Events
  • Featured Talks
  • Blog
  • MailChimp
Contact

The $30/hr Data Scientist

3/6/2014

11 Comments

 
Yesterday a journalist asked me to comment on Vincent Granville's post about the $30/hr data scientist for hire on Elance. What started as a quick reply in an email, spiraled a bit, so I figured I'd post the entire reply here to get your thoughts in the comments.

When we ask the question, "Can someone do what a data scientist does for $30/hr?" we first need to answer the question, "What does a data scientist do?" And there are a multitude of answers to that question. 

If by data scientist, we mean " a person who can perform a data summary, aggregation or modeling task that has been well-defined for them in advance" then it is by no means a surprise that there are folks who can do this at a $30/hr price point. Indeed, there'll probably come a day where that task can be completed for free by software without the freelancer. This is similar to the evolution of web development freelancing.

The key phrase though is "task that has been well-defined."

The types of data scientists who command large salaries seem to meet two very different definitions than what a freelancer at $30/hr can meet:

1) There's the highly-technical engineer. Someone who is knowledgeable and skilled enough to select the correct tools and infrastructure in the polluted big-data landscape to solve a specific, highly-technical data problem. Often these folks are working on problems that haven't been solved before or if they have there are only a few poorly documented examples. Because these tasks might not even be solvable, they're certainly not "well-defined." A business wouldn't trust important bits of infrastructure to $30/hr.

2) There's the data scientist as communicator/translator. This person is someone who knows data science techniques intimately but whose strength is actually in the nontechnical -- this person thrives on taking an ambiguous business situation and distilling it into a data science solution. Often managers and executives don't know what's possible. They know what problems they have, but they don't know how or even if data science can solve those problems. These folks can't hire someone halfway across the globe at $30/hr to figure that out for them. No, they need someone who's deeply technical but also deeply personable in the office to talk things through with them and guide them.

All of the hype around data science is generating a lot of these articles about automating or replacing the role. But 
I think it's important to realize that just like "doctor," "lawyer," "consultant," "developer," etc. the "data scientist" is more of a spectrum or category than a single role.

A data scientist is not someone putting doors on an automobile in a factory. Some of them might be doing just that, i.e. rote modeling tasks. But not all of them. I believe that MOOCs will excel at teaching up an army of these lower-paid data scientists. And that's great. They'll fill a need. Kinda like the need in the 90s for people with basic COMPTIA certifications and the most basic of Cisco certs.

However, there will always be a place for those who excel at solving ambiguous technological & business problems. And they'll cost more than $30/hr.

11 Comments
Roy
3/6/2014 11:48:16

As a counter point, I'm a "legit" days scientist with a PhD. I did my first data science on a freelancing site. Having no client history on the site, I started around $25/hr and incrementally increased my price to $100+/hr. I think this is a common strategy.

Reply
MasterG
3/12/2014 23:28:31

You would have been much better off, in terms of experience, training and pay, if you had taken on one or two paid internships while you were in graduate school. The pay is better and you could have received valuable mentoring.

Reply
JK
3/29/2014 09:24:23

That's not the issue; the issue is that until you have a reputation on these freelancing sites, it's hard to charge a lot. You build up your profile's history and rating by taking on clients and getting them to rate you. Kind of like eBay's reputation system.

Hezi
3/6/2014 12:34:32

So if I want to belong to the second category of data scientists (aka the specialised), how should I train myself?

Reply
guy
3/13/2014 11:08:09

It's not a simple matter of training yourself. Even a fresh faced phd holding degrees in applied mathematics and computer science isn't really ready to be considered part of #2. That takes seasoning, and several years of it. 10 at least? Not just seasoning, but exposure to a variety of different environments and mentorships. There's a reason why in most sciences a phd is followed by several post-docs.

Reply
Max
3/8/2015 05:22:49

I find that a PhD+ tends to be a contra-indicator of ability in #2. In academia, the incentives are in place to impress an ultra-elite minority, and the peer review process almost seems biased against papers that are readable to normal people.

Experience in the industry would probably help, but really #2 is a skill that has more to do with learning how to teach than learning about science.

If you're someone who liked to take extra time in math classes to explain things to a person who was having difficulty, then #2 probably comes naturally to you. It's about listening to others and caring about their understanding. It's more a human than technical skill, and doesn't require "10 years" in the way the post above me is implying. It takes quite a lot longer, but we've been working on it our whole lives.

Tim
3/12/2014 23:53:16

>>just like "doctor," "lawyer," "consultant," "developer"

Interestingly, Doctor and Lawyer are both protected - you can't just go calling yourself a doctor without a certain background, and legally you can't do certain buisness (e.g. open a clinic) without hiring one. Arguably, this keeps the price up.

Another protected profession is an Actuary. The interesting thing about data science is the legions of people working for insurance and pensions companies, banks, and other boring oldschool places, who have been doing this kind of work with these kinds of techniques for decades.

You could argue that big data (i.e. the vast inflow of data from the web and devices) changes things, but actuaries have been working with unaggregated census data, medical records, or devising credit ratings for years. That data is pretty big all told.

So what is a data scientist?

An interesting thing about doctors is that mostly, they don't need to think that much. Thinking has been systematically removed from the profession. There are algorithms for diagnosis and treatement, hopefully quantitively proved, and you get sued if you don't follow them.

The internet is full of people who go to a doctor, who applies an algorithm which works 99.9% of the time, for years, before finding one who will think enough to realize they have cancer.

The internet is full of people who go to a homeopath, who sounds very convincing, and gives them highly diluted asparagus juice to cure their depression. Then they get better naturally and tell everyone it worked. Or they die.

Data science isn't a protected profession. Unlike actuaries, they are not bound by proccess, compulsary best practice, or educational requirements.

That means that they can be both these things - a rogue doctor seeing the truth unblinded by proccess. Or the charlatan.

Software engineering is like that too.

Reply
Jess
3/18/2014 12:55:45

Yea. Try out your chiropractor and your acupuncturist. Try a colon cleanse. And then go back to the 70 y/o GP that smokes and eats salami everyday.

Reply
Tim
3/18/2014 23:44:35

I'm too dumb to understand what you mean. What did you mean?

MrTonyD
3/8/2015 05:38:27

What he means is that that you have created oversimplified categories. One example is chiropractic/acupuncturist and associated fields you dismiss. Check the World Health Organization for some of their summaries of good studies showing beneficial application areas (which are dismissed by the US AMA for political reasons.)
And this ties into the history of licensing - which has always been sold as a way to reduce competition - and often reduces quality.

Alex
3/30/2014 13:46:53

How do you see the market demand for these "lower-paid data scientists" compared to the ones commanding large salaries?

Reply

Your comment will be posted after it is approved.


Leave a Reply.

    Author

    Hey, I'm John, the data scientist at MailChimp.com.

    This blog is where I put thoughts about doing data science as a profession and the state of the "analytics industry" in general.

    Want to get even dirtier with data? Check out my blog "Analytics Made Skeezy", where math meets meth as fictional drug dealers get schooled in data science.

    Reach out to me on Twitter at @John4man

    Picture
    Click here to buy the most amazing spreadsheet book you've ever read (probably because you've never read one).

    Archives

    January 2015
    July 2014
    June 2014
    May 2014
    March 2014
    February 2014
    January 2014
    November 2013
    October 2013
    September 2013
    August 2013
    July 2013
    May 2013
    February 2013

    Categories

    All
    Advertising
    Big Data
    Data Science
    Machine Learning
    Shamelessly Plugging My Book
    Talent
    Talks

    RSS Feed


✕