Data
Skills
Data Engineering
- Defining Data Intuition
Ryan proposes the following definition for data intuition: a resilience to misleading data and analyses.
- Quick Guide: Calculate Cohort Retention Analysis with SQL
Huy provides a step-by-step walk-through of how to generate cohort retention by month using two source tables - users and activities.
- Practical SQL for Data Analysis
Haki provides an in-depth guide to using SQL for fast and efficient data analysis. He dives into specific tactics including descriptive statistics, subtotals, pivot tables, running and cumulative aggregation, linear regressions, interpolations, and binning.
- Analytical Excellence Is All about Speed
Cassie provides a nuanced take on the value of speed in data analytics. This is a multi-article series that covers: • Software skills • Handling lots of data with ease • Immunity to data science bias • Understanding the analyst's career path • Refusing to be a data charlatan • Resistance to confirmation bias • Realistic expectations of data • Knowing how to add value • Thinking differently about time
- Building high-performing Research and Data Science teams with clear career paths
Karen describes the process and results of building a career ladder for the research, analytics, and data science team at Intercom. The IC track covers 6 levels and 6 skill groups, and the manager track covers 2 levels with 5 skill groups.
- Data Science Career Path & Progression
Julien explains the 4 skill groups (he calls them data axes) of data career paths: • Data axis • Engineering axis • Business axis • Product axis
- Roadmap to Learn SQL
Arif provides a list of the key concepts that you'll need to learn to learn SQL, along with the order to learn them in.
- So You’ve Got a Really Big Dataset. Here’s How You Clean It.
Li-Lian provides a step-by-step guide to cleaning large datasets in Python, using the Pandas and Matplotlib libraries. She explains how to filter data, standardize missing data labels, clean dependent variables, remove duplicate entries, and check for missing values in each variable and row. She suggests auditing variables by type and providing suggestions for each type, including Boolean, datetime, numerical, categorical, and text.
- Kickstarter's Engineering & Data Science Career Ladders
Kickstarter's approach has fewer levels and uses textual descriptions rather than a matrix of skills.
- Data Science Competency Matrix
Angela covers 7 IC levels and 5 managers level