Thinking about candidates in the #DataScience job market:— Levi Bowles (@LeviABx) October 22, 2019
40% just out of school, no real experience/skills.
40% 1-2 years out of school, very limited experience/skills.
15% David Shing hype clones.
4% experienced, good, but incomplete skills.
1% unicorn candidates.
As the weeks have gone by (and my employment situation has become a bit weird--in good ways), I've often thought about that tweet. Essentially, since the day I said this, I've noticed it to be more true than I originally thought--Especially on the Shingy front.
I thought it would might good to put together some more in-depth thoughts on the different types of Data Science candidates I see on the market and how they relate to real roles in companies. In result, this blog can serve as a helpful guide in building a data science team at your organization. I've gone into more detail of the five types of candidates below.
Directly Out of School
The trademark of these candidates are recently completing grad school with no or limited work experience. A few resumes will have a history of internships, or will try to pass off class projects as "jobs they worked." My general thoughts:- There are a lot of these candidates-a lot.
- They harass Data Science managers regularly on social media, LinkedIn and email.
- Many of them are not yet talented, and are going to take a lot of work before they're tackling projects on their own.
- Some of them have not been screened for what I call "employability" and are unlikely to survive the rigor, rules and norms of the workplace.
- Be careful of buzzword slingers.
Couple of Years Experience
These candidates have been out in the job market for one to five years and generally with one or two employers. Resumes will generally have real projects worked on listed, though being a junior-level employee, you don't know what their actual role involved. My thoughts on this group:- There are quite a few in this group as well.
- The loudest ones are usually in the process of failing out of a first job.
- A lot of them are well on their way to a great career, though.
- Their skills may be largely defined by the experience of their first job-so there will still be a big training job ahead.
- It is very important to determine *exactly* what their roles are on teams and projects, beware of Tableau jockeys and spreadsheet analysts pumping their resume.
Shingy Clones
First, who's shingy? This guy.These candidates are full of hot air, very much hyped on Data Science as a concept--also other hyped concepts--have fun getting them to talk about block chain. The dark side of course is that they have no Data Science abilities and are just low-rent hype people
- They will come with a lot of energy and enthusiasm, which can be hypnotizing, especially for executives.
- These people are the definition of why the interview process is critical.
- They are completely destructive if you hire one, will always be hyping and saying we need new technology or to do "x". However they don't have the skills or knowledge to understand what they are suggesting or how to deliver.
- They have no clue what they are talking about.
Advice: You can't hire these people. They are going to be a high cost with zero deliverables. To avoid this put some matrix algebra or simple calculus questions on your interview. Coefficient interpretation? Ask them questions about coding in un-sexy languages (SQL). On difficult questions this group will break.
(As an aside, I've had a few of these people try to gaslight me, and then end up yelling at me in an interview. It's not fun to be yelled at, but when this happens, I know I've dodged a bullet in calling someone out.)
Experienced Statisticians
These candidates are more advanced in their career, and often will shun the term Data Scientist. They may be less striking at first, and certainly with less flash than a 25-year-old machine learning expert, but add a ton of value to your organization.- These candidates generally build models, often outperforming data scientists models while using simpler, more elegant methods.
- They can be great mentors to young data scientists-if junior staff are willing to listen.
- They often lack machine learning or big data system (e.g. Hadoop).
- They will also lack some more modern coding/computing skills (e.g. containerization, cloud, etc).
- One successful tactic is to use this type of employee on a project team with a machine learning expert and a data engineer. The data engineer will bring the technical coding skills, the machine learning expert will bring modern methods, and the more experienced employee will bring research design and rigor.
Unicorns
These are the classic Data Scientists that many organizations are looking for. Their traits:- 15+ years in building machine learning models.
- 15+ years building econometric models.
- Production level developer who has built massive productionized ML systems
- Hadoop/Spark developer.
- 100x ROI.
- Virtually non-existent.
Summary
A few months ago a recruiter called me and asked if I had 15 years of experience in Hadoop. This is an absurd question given that Hadoop's first release was in 2006, but it speaks to an underlying truth: many organizations are looking for a Data Science candidate pool that simply does not exist. I hope that the essential takeaway allows you to build a reasonable Data Science team with building blocks based on the talent actually available. A reasonable team might involve:- Directly out of School: 1-2 FTE
- Couple Years Experience: 2 FTE
- Shingy Clones: 0 FTE
- Experienced Statisticians: 1 FTE
- Unicorns: 1 - if you can find one, but not necessary
As an aside, the model without "Unicorn" candidates will likely require substantial help from data engineers and some developers in order to get data into systems, and models into production. This does create some inefficiencies, but is often less expensive than finding a unicorn candidate.
Good advice on data science teams. I've found its also helpful to have a few other support members on the team or at least available to help out. A visualization expert, that is someone who can build charts and powerpoint slides to present to management or the client. Plus someone who can translate data science into English for managers to understand. Its the managers that control the purse strings. If a data scientist can do these functions great, but do you rally want your high price data scientist building powerpoint slides or building ML models.
ReplyDeleteMachine Learning Projects for Final Year machine learning projects for final year
DeleteDeep Learning Projects assist final year students with improving your applied Deep Learning skills rapidly while allowing you to investigate an intriguing point. Furthermore, you can include Deep Learning projects for final year into your portfolio, making it simpler to get a vocation, discover cool profession openings, and Deep Learning Projects for Final Year even arrange a more significant compensation.
Python Training in Chennai Python Training in Chennai Angular Training Project Centers in Chennai
Hi, Thanks for sharing nice articles...
ReplyDeleteData Science Training in Hyderabad
Interesting post. I Have Been wondering about this issue, so thanks for posting. Pretty cool post.It 's really very nice and Useful post.Thanks
ReplyDeletebusiness analytics course
Here is the site(bcomexamresult.in) where you get all Bcom Exam Results. This site helps to clear your all query.
ReplyDeleteRMLAU BCOM 3rd Year Result 2020
BA 3rd year Result 2019-20
Sdsuv University B.COM 3rd/HONOURS Sem Exam Result 2018-2021
This is quite charming post you shared, I like the post, an obligation of appreciation is all together for sharing..
ReplyDelete360DigiTMG pmp certification
This is quite charming post you shared, I like the post, an obligation of appreciation is all together for sharing..
ReplyDeletePMP certification
I need to communicate my deference of your composing aptitude and capacity to make perusers read from the earliest starting point as far as possible. I might want to peruse more up to date presents and on share my musings with you.
ReplyDeletehttps://360digitmg.com/course/data-analytics-using-python-r
Truly overall quite fascinating post. I was searching for this sort of data and delighted in perusing this one. Continue posting. Much obliged for sharing.business analytics training
ReplyDeleteI need to communicate my deference of your composing aptitude and capacity to make perusers read from the earliest starting point as far as possible. I might want to peruse more up to date presents and on share my musings with you.
ReplyDelete360DigiTMG data analytics courses
ReplyDeleteI'm cheerful I found this blog! Every now and then, understudies need to psychologically the keys of beneficial artistic articles forming. Your information about this great post can turn into a reason for such individuals.
https://360digitmg.com/course/project-management-professional-pmp
I need to communicate my deference of your composing aptitude and capacity to make perusers read from the earliest starting point as far as possible. I might want to peruse more up to date presents and on share my musings with you.
ReplyDeletedata analytics courses
You really make it look so natural with your exhibition however I see this issue as really something which I figure I could never understand. It appears to be excessively entangled and incredibly expansive for me.
ReplyDeleteproject management certification
I think I have never watched such online diaries ever that has absolute things with all nuances which I need. So thoughtfully update this ever for us.
ReplyDeletebig data course
I think I have never watched such online diaries ever that has absolute things with all nuances which I need. So thoughtfully update this ever for us.
ReplyDeletehttps://360digitmg.com/course/data-analytics-using-python-r
It's late discovering this demonstration. At any rate, it's a thing to be acquainted with that there are such functions exist. I concur with your Blog and I will have returned to assess it more later on so please keep up your demonstration.
ReplyDeletedata scientist course
Python is one of the most popular, general purpose, interpreted and high-level programming language. SOL Technologies Solutions delivers best python training institute in Delhi.
ReplyDeletePython training institute in delhi
Python training Course in delhi
Thanks for Sharing a Very Informative Post & I read Your Article & I must say that is very helpful post for us.
ReplyDeleteOnline Data Science Classes
Selenium Training in Pune
AWS Online Classes
Python Online Classes
Thanks for Sharing This Article.It is very so much valuable content. I hope these Commenting lists will help to my website
ReplyDeletedata science course in gurgaon
nice blog!! i hope you will share a blog on Data Science.
ReplyDeletedata analytics course in yelahanka