MODERN DATA SCIENTIST

Data Scientist. the sexiest job of the 21th century, requires a mixture of multidisciplinary skills ranging from an intersection of mathematics, statistics, computer science, communication and business. Finding a data scientist is hard. Finding people who understand who a data scientist is. is equally hard. So here is a little cheat sheet on who the modern data scientist really is.

Big Data Data Science
  • MATH & STATISTICS

    ×

    How to do this task:
    Subtasks:
  • Machine learning

    ×

    How to do this task:
    Subtasks:
  • Statistical modeling

    ×

    How to do this task:
    Subtasks:
  • Experiment design

    ×

    How to do this task:
    Subtasks:
  • Bayesian Inference

    ×

    How to do this task:
    Subtasks:
  • Supervised learning: decision trees, random forests. logistic regression

    ×

    How to do this task:
    Subtasks:
  • Unsupervised learning: clustering. dimensionality reduction

    ×

    How to do this task:
    Subtasks:
  • Optimization; gradient descent and variants

    ×

    How to do this task:
    Subtasks:
  • DOMAIN KNOWLEDGE & SOFT SKILLS

    ×

    How to do this task:
    Subtasks:
  • Passionate about the business

    ×

    How to do this task:
    Subtasks:
  • Curious about data

    ×

    How to do this task:
    Subtasks:
  • Influence without authority

    ×

    How to do this task:
    Subtasks:
  • Hacker mindset

    ×

    How to do this task:
    Subtasks:
  • Problem solver

    ×

    How to do this task:
    Subtasks:
  • Strategic, proactive. Creative. innovative and collaborative

    ×

    How to do this task:
    Subtasks:
  • PROGRAMMING & DATABASE

    ×

    How to do this task:
    Subtasks:
  • Computer science fundamentals

    ×

    How to do this task:
    Subtasks:
  • Scripting language e.g. Python

    ×

    How to do this task:
    Subtasks:
  • Statistical computing package, e.g. R

    ×

    How to do this task:
    Subtasks:
  • Databases; SOL and NOSOL

    ×

    How to do this task:
    Subtasks:
  • Relational algebra

    ×

    How to do this task:
    Subtasks:
  • Parallel databases and parallel query processing

    ×

    How to do this task:
    Subtasks:
  • MapReduce concepts

    ×

    How to do this task:
    Subtasks:
  • Hadoop and Hive/Pig

    ×

    How to do this task:
    Subtasks:
  • Custom reducers

    ×

    How to do this task:
    Subtasks:
  • Experience with xaaS like AWS

    ×

    How to do this task:
    Subtasks:
  • COMMUNICATION & VISUALIZATION

    ×

    How to do this task:
    Subtasks:
  • Able to engage with senior management

    ×

    How to do this task:
    Subtasks:
  • Story telling skills

    ×

    How to do this task:
    Subtasks:
  • Translate data-driven insights into decisions and actions

    ×

    How to do this task:
    Subtasks:
  • Visual art design

    ×

    How to do this task:
    Subtasks:
  • R packages like ggplot or lattice

    ×

    How to do this task:
    Subtasks:
  • Knowledge of any of visualization tools e.g. Flare, D3js, Tableau

    ×

    How to do this task:
    Subtasks:

333 copy saved

333 copies saved