Highly effective data teams
A list of seven habits
This post captures the essence of my experience building teams at various companies. The title and outline are from the book by Stephen Covey.
Effective data teams are proactive about infrastructure decisions, about customer data protection, and about the design of their organization.
Begin with the end in mind.
Effective data teams identify, firstly, the business impact of of their work.
When doing data analysis, they deliver insights not tables, action items not dashboards. The analysis of data is not complete until a conclusion is drawn from all the generated tables and graphs, and the business decision is clear. A table belongs in a DB. Graphs sometimes reveal only noise. Inference and decision making by looking at tables and graphs is having to run stat-sig tests in your head. The data team is in the best position to make sense of the tables and graphs, to tell its story, and to determine if a change is worth acting upon.
When embarking on a machine learning project, effective data teams build an understanding of the intended function of the model as part of the larger product and how it is meant to change and contribute to the customer experience. They use this understanding to determine shipping and success criteria in the context of an experiment framework.
Put first things first.
Having built confidence that the underlying data is correct, effective data teams prioritize thoughtful metric design.
Metrics heavily summarize and simplify reality — disposing of its many details in order to handle complexity. This highly simplified version of reality is used to inform decisions. Business processes and ML models are built such that the value of a metric is maximized or minimized. It’s an important responsibility.
Metric design is an opportunity to build relationships with stakeholders, review data quality, and understand the business. The success of a data team is based upon the relationships, trust, and intuition the team gains, and also the data quality control processes they implement in this part of the work.
Effective data teams are the voice of the customer. A successful business needs happy customers. Low quality features, short-term wins, and bugs the team can get away with, decay the product and consequently the brand.
Seek first to understand, then to be understood.
The data science team does not operate independently — just like every other team. The work is useful if it’s rooted in what the company can accomplish. Effective data teams understand the capabilities of their cross-functional collaborators in acting upon the results and recommendations.
Effective data teams are in sync with all cross-functional collaborators. The team culture is such that collaboration is prized over competition. Teams achieve more than individuals working independently.
Sharpen the saw.
Effective data teams hire for experience to help establish best practices, provide training, and solve complex problems.
What practices have you found useful in building effective data science teams? Let me know!