Thursday, January 2, 2020

Data Science Job Market

A few weeks ago I was thinking about the state of the current Data Science job market--and a bit frustrated about inquiries by job seekers on LinkedIn, I hastily sent out a tweet on my thoughts..

As the weeks have gone by (and my employment situation has become a bit weird--in good ways), I've often thought about that tweet.  Essentially, since the day I said this, I've noticed it to be more true than I originally thought--Especially on the Shingy front.

I thought it would might good to put together some more in-depth thoughts on the different types of Data Science candidates I see on the market and how they relate to real roles in companies. In result, this blog can serve as a helpful guide in building a data science team at your organization.  I've gone into more detail of the five types of candidates below.

Directly Out of School

The trademark of these candidates are recently completing grad school with no or limited work experience.  A few resumes will have a history of internships, or will try to pass off class projects as "jobs they worked." My general thoughts:
  • There are a lot of these candidates-a lot.
  • They harass Data Science managers regularly on social media, LinkedIn and email.
  • Many of them are not yet talented, and are going to take a lot of work before they're tackling projects on their own.
  • Some of them have not been screened for what I call "employability" and are unlikely to survive the rigor, rules and norms of the workplace.
  • Be careful of buzzword slingers.
Advice: You can take on a few of these and they can add value for your team over time.  But you can and should screen heavily before choosing your candidate.  The sheer number of these candidates on the market gives the employer the luxury of being picky-and because many of these candidates come untested and without serious references (professors are not real references) you need to use your best BS detecting.


Couple of Years Experience

These candidates have been out in the job market for one to five years and generally with one or two employers.  Resumes will generally have real projects worked on listed, though being a junior-level employee, you don't know what their actual role involved.  My thoughts on this group:
  • There are quite a few in this group as well.
  • The loudest ones are usually in the process of failing out of a first job.
  • A lot of them are well on their way to a great career, though.
  • Their skills may be largely defined by the experience of their first job-so there will still be a big training job ahead. 
  • It is very important to determine *exactly* what their roles are on teams and projects, beware of Tableau jockeys and spreadsheet analysts pumping their resume.
Advice: These employees are generally a great investment, though you have to be very careful not to be picking up another organization's castoffs.  They can begin to take on larger projects on their own, or serve as junior mentors to the first category of employees.  This group should be seen as your bridge the future Data Science team, the group that will be your Senior and Principal data scientists within 2-5 years.


Shingy Clones

First, who's shingy?  This guy.





These candidates are full of hot air, very much hyped on Data Science as a concept--also other hyped concepts--have fun getting them to talk about block chain.  The dark side of course is that they have no Data Science abilities and are just low-rent hype people
  • They will come with a lot of energy and enthusiasm, which can be hypnotizing, especially for executives.
  • These people are the definition of why the interview process is critical.
  • They are completely destructive if you hire one, will always be hyping and saying we need new technology or to do "x".  However they don't have the skills or knowledge to understand what they are suggesting or how to deliver.
  • They have no clue what they are talking about.
Advice: You can't hire these people.  They are going to be a high cost with zero deliverables.  To avoid this put some matrix algebra or simple calculus questions on your interview.   Coefficient interpretation? Ask them questions about coding in un-sexy languages (SQL).  On difficult questions this group will break.  

(As an aside, I've had a few of these people try to gaslight me, and then end up yelling at me in an interview.  It's not fun to be yelled at, but when this happens, I know I've dodged a bullet in calling someone out.)


Experienced Statisticians

These candidates are more advanced in their career, and often will shun the term Data Scientist.  They may be less striking at first, and certainly with less flash than a 25-year-old machine learning expert, but add a ton of value to your organization.
  • These candidates generally build models, often outperforming data scientists models while using simpler, more elegant methods.
  • They can be great mentors to young data scientists-if junior staff are willing to listen.
  • They often lack machine learning or big data system (e.g. Hadoop).
  • They will also lack some more modern coding/computing skills (e.g. containerization, cloud, etc).
  • One successful tactic is to use this type of employee on a project team with a machine learning expert and a data engineer.  The data engineer will bring the technical coding skills, the machine learning expert will bring modern methods, and the more experienced employee will bring research design and rigor.
Advice: These candidates are some of the best deals on the market, mainly because they can mentor and "fix" a lot of the missing knowledge of young data scientists.  Younger data scientists tend to have bad habits, or in cases massive holes in their skillsets and intuition around research design, rigor, probability, and statistics.  And more simply put-these employees often can do better work, using older methods and are great mentors.


Unicorns

These are the classic Data Scientists that many organizations are looking for.  Their traits:
  • 15+ years in building machine learning models.
  • 15+ years building econometric models.
  • Production level developer who has built massive productionized ML systems
  • Hadoop/Spark developer.
  • 100x ROI.
  • Virtually non-existent.
I'm being a bit hyperbolic, but these candidates essentially don't exist.  Well, some do, but you may not want to pay the premium involved (it's high).  If you can find one, by all means hire.  On the other hand, you can build an all-star team fairly well by focusing on building blocks.

Summary

A few months ago a recruiter called me and asked if I had 15 years of experience in Hadoop.  This is an absurd question given that Hadoop's first release was in 2006, but it speaks to an underlying truth:  many organizations are looking for a Data Science candidate pool that simply does not exist.  I hope that the essential takeaway allows you to build a reasonable Data Science team with building blocks based on the talent actually available. A reasonable team might involve:
  • Directly out of School: 1-2 FTE
  • Couple Years Experience: 2 FTE
  • Shingy Clones: 0 FTE
  • Experienced Statisticians: 1 FTE
  • Unicorns: 1 - if you can find one, but not necessary
As an aside, the model without "Unicorn" candidates will likely require substantial help from data engineers and some developers in order to get data into systems, and models into production.  This does create some inefficiencies, but is often less expensive than finding a unicorn candidate.  

28 comments:

  1. Good advice on data science teams. I've found its also helpful to have a few other support members on the team or at least available to help out. A visualization expert, that is someone who can build charts and powerpoint slides to present to management or the client. Plus someone who can translate data science into English for managers to understand. Its the managers that control the purse strings. If a data scientist can do these functions great, but do you rally want your high price data scientist building powerpoint slides or building ML models.

    ReplyDelete
  2. This comment has been removed by a blog administrator.

    ReplyDelete
    Replies
    1. This comment has been removed by the author.

      Delete
  3. Very good and informative article. Thanks for sharing such nice article, keep on updating such good articles.
    Adobe photoshop training Institute in Chennai | Adobe photoshop training Institute in Madipakkam

    ReplyDelete
  4. Very good and informative article. Thanks for sharing such nice article, keep on updating such good articles.
    MS Office training in Chennai | MS Office training in Madipakkam

    ReplyDelete
  5. Very good and informative article. Thanks for sharing such nice article, keep on updating such good articles.
    Basic Computer Course in Chromepet | Basic Computer Course in Chennai

    ReplyDelete
  6. Awesome post. Really you are shared very informative concept... Thank you for sharing. Keep on updating...
    MCSA exam center in Chennai | MCSA exam center in pallikaranai

    ReplyDelete
  7. Awesome post. Really you are shared very informative concept... Thank you for sharing. Keep on updating...
    UI Path training center in Chennai | UI Path training center in pallikaranai

    ReplyDelete
  8. Very good and informative article. Thanks for sharing such nice article, keep on updating such good articles.
    Adobe Photoshop training center in Chennai | Adobe Photoshop training center in Velachery

    ReplyDelete
  9. Very good and informative article. Thanks for sharing such nice article, keep on updating such good articles.
    MS.Excel Training center in Chennai | | MS.Excel Training center in Velachery

    ReplyDelete
  10. This comment has been removed by a blog administrator.

    ReplyDelete
  11. Wow, this is fascinating reading. I am glad I found this and got to read it. Great job on this content. I liked it a lot. Thanks for the great and unique info. job posting site

    ReplyDelete
  12. Hello, this weekend is good for me, since this time i am reading this enormous informative article here at my home. install tensorflow anaconda

    ReplyDelete
  13. Can Forex Trading Make You Rich?

    Forex Duality Tradeology is a digital product​ that helps people interested in forex trading to understand its basics and learn how to use the trading software. Forex scalping is a day trading technique where Forex trader executes a trade and exit within minutes or seconds on some cases. You'll want to leave your PC on with MetaTrader 4 open and running 24 hours a day, five days a week. The type of currency which is being spent is the base currency while the one that is being bought is the quote currency; in forex trading one currency is sold to purchase another one. Tradeology - forex duality review. 4. Trading live with a like-minded group of other traders, conversing and sharing trade strategies and calls.

    Some governments of emerging markets do not allow foreign exchange derivative products on their exchanges because they have capital controls The use of derivatives is growing in many emerging economies. So irrespective of the event that has triggered the movement - forex traders do not care. If you normally trade a mini lot (10,00 units of a currency), use a micro lot (1,000 units) instead, because the price differences for trades on a weekly scale can be significantly greater than when trading over shorter time periods.

    This allows traders to trade in multiple lots. The system also comes with well defined guidelines on how to manage the trade so that a trader can have maximum profits. Discussing Nicola Delic and Nicola Delic foreign exchange experience can leave you astonished. Whenever you are traveling abroad or you are exchanging your currency or receiving or transferring money abroad, then you need to make the forex transaction. That's why they say that all fortunate traders make money in different ways, but all failing traders lose it the same way. https://www.reviewengin.com/tips-to-write-eye-catching-headline-for-blog/

    ReplyDelete
  14. I am impressed by the information that you have on this blog. It shows how well you understand this subject.
    data analytics course
    big data analytics malaysia
    big data course

    ReplyDelete
  15. I can see that you are an expert at your field! I am launching a website soon, and your information will be very useful for me.. Thanks for all your help and wishing you all the success in your business.
    PMP Certification
    PMP Certification in Malaysia
    PMP Course
    PMP Course in Malaysia

    ReplyDelete
  16. I will be interested in more similar topics. i see you got really very useful topics , i will be always checking your blog thanks
    data science course

    ReplyDelete
  17. I have read a few of the articles on your website now, and I really like your style of blogging. I added it to my favorites blog site list and will be checking back soon. Please check out my site as well and let me know what you think. how to buy instagram likes cheap

    ReplyDelete
  18. Easily, the article is actually the best topic on this registry related issue. I fit in with your conclusions and will eagerly look forward to your next updates.
    PMP Certification
    PMP Course
    PMP Course in Malaysia
    PMP Training in Malaysia
    PMP Training

    ReplyDelete
  19. Awesome blog. I enjoyed reading your articles. This is truly a great read for me. I have bookmarked it and I am looking forward to reading new articles. Keep up the good work!
    data analytics course
    big data analytics malaysia
    big data course

    ReplyDelete
  20. It is extremely nice to see the greatest details presented in an easy and understanding manner.data science course

    ReplyDelete
  21. It is perfect time to make some plans for the future and it is time to be happy. I've read this post and if I could I desire to suggest you some interesting things or suggestions. Perhaps you could write next articles referring to this article. I want to read more things about it! PMP Certification 360DigiTMG
    PMP Course 360DigiTMG
    PMP Course in Malaysia 360DigiTMG
    PMP Training 360DigiTMG
    PMP Training in Malaysia 360DigiTMG

    ReplyDelete
  22. Awesome blog. I enjoyed reading your articles. This is truly a great read for me. I have bookmarked it and I am looking forward to reading new articles. Keep up the good work!
    data analytics course
    big data analytics malaysia
    big data course

    ReplyDelete
  23. Great Article it its really informative and innovative keep us posted with new updates. its was really valuable. thanks a lot.data science course

    ReplyDelete
  24. Nicely explaned thanks for sharing..keep posting such a amazing post.
    MOVIES4YOU
    TE3N Full movie Download

    ReplyDelete