Data

Best Practices

Interested in learning about data with our AI Coaching?

Our AI Coach will help you overcome challenges and celebrate your progress along the way

Start learning for free

Skills

Add skills based on what you want to learn.

Popular Resources

Most Recent

How we curate

Defining Data Intuition
Ryan Harter, Principal Data Scientist, Mozilla
Ryan proposes the following definition for data intuition: a resilience to misleading data and analyses.
Data Intuition
Quick Guide: Calculate Cohort Retention Analysis with SQL
Huy Nguyen, CTO and Cofounder, Holistics Data
Huy provides a step-by-step walk-through of how to generate cohort retention by month using two source tables - users and activities.
Cohort Analysis
Practical SQL for Data Analysis
Haki Benita, Development Team Lead, Pcentra
Haki provides an in-depth guide to using SQL for fast and efficient data analysis. He dives into specific tactics including descriptive statistics, subtotals, pivot tables, running and cumulative aggregation, linear regressions, interpolations, and binning.
SQL
Analytical Excellence Is All about Speed
Cassie Kozyrkov, Chief Decision Scientist, Google
Cassie provides a nuanced take on the value of speed in data analytics. This is a multi-article series that covers: • Software skills • Handling lots of data with ease • Immunity to data science bias • Understanding the analyst's career path • Refusing to be a data charlatan • Resistance to confirmation bias • Realistic expectations of data • Knowing how to add value • Thinking differently about time
Analytics
Building high-performing Research and Data Science teams with clear career paths
Karen Church, VP Research & Data Science, Intercom
Karen describes the process and results of building a career ladder for the research, analytics, and data science team at Intercom. The IC track covers 6 levels and 6 skill groups, and the manager track covers 2 levels with 5 skill groups.
Data Science Career Ladders
Data Science Career Path & Progression
Julien Kervizic, Senior Enterprise Data Architect, GrandVision
Julien explains the 4 skill groups (he calls them data axes) of data career paths: • Data axis • Engineering axis • Business axis • Product axis
Data Science Career Ladders
Roadmap to Learn SQL
Arif Alam, Founder, Data Science Reality
Arif provides a list of the key concepts that you'll need to learn to learn SQL, along with the order to learn them in.
SQL
So You’ve Got a Really Big Dataset. Here’s How You Clean It.
Li-Lian Ang, Community and Operations Manager, BlueDot Impact
Li-Lian provides a step-by-step guide to cleaning large datasets in Python, using the Pandas and Matplotlib libraries. She explains how to filter data, standardize missing data labels, clean dependent variables, remove duplicate entries, and check for missing values in each variable and row. She suggests auditing variables by type and providing suggestions for each type, including Boolean, datetime, numerical, categorical, and text.
Data Cleaning
Prioritizing Data Science Work
Jacqueline Nolis, Principal Data Scientist, Fanatics
As a data scientist, you are constantly deciding what tasks to prioritize. There are many requests from stakeholders but not all have the same impact or innovativeness. Jacqueline recommends prioritizing projects that are both innovative and impactful as they have the greatest potential to change the business. Projects that are not innovative but still provide useful proof can also be valuable. Jacqueline advises against getting stuck doing interesting but irrelevant work or only reporting, as these contribute less to the company. Data scientists should aim to do work that both affects the company and is innovative.
Prioritization for Data Work
Prioritising the Scientific Way
Shyam Sundar Dhanabalan, Director of Data, Analytics & Strategy, Delivery Hero
Shyam proposes a scientific framework for prioritization consisting of a first principles approach and second order thinking. The first principles approach involves breaking down problems into fundamental components to remove biases. Second order thinking considers the consequences of consequences to uncover hidden impacts and complexities. Shyam outlines four types of complexities - structural, technical, temporal and directional. Complications are distinguished from complexities. A scientific prioritization process involves documenting evidence to improve the methodology over time. Consistency is key to allow positive effects to compound, and documentation helps transfer accountability to the process rather than individuals.
Prioritization for Data Work

Skills

Analytics Execution

Data Science Execution

Data Engineering