By that, I mean beyond the theoretical ideals of Data Science textbooks. Beyond just the coding challenges of how to use R or Python to encode a question. Books that engage with real world challenges for analysts.
So, I am delighted to share one that I have discovered that does just that. A book based on practitioner experience. One that addresses the diverse challenges to delivering effective analytics in today’s changing businesses.
The book is called “Guerrilla Analytics” and is written by Enda Ridge, who is Chief Data Scientist at Sainsburys. The subtitles of this book are “a practical approach to working with data” and “the savvy manager’s guide” and it delivers on both promises.
The need for Guerrilla Analytics
From the broad sweep & ill defined nature of analytics to the number of things that change. Outside the idealistic textbooks, in the real world, data, requirements & resources all change & too many processes are ill defined. Add to that the constraints of limited time & toolsets, it is no wonder that too little analytics is robust & repeatable.
To directly address this issue, together with a number of other terms, Enda defines the term Guerrilla Analytics. His definition is:
“Guerrilla Analytics is data analytics performed in a very dynamic project environment that presents the team with varied and frequent disruptions and constrains the team in terms of the resources they can bring to bear on their analytics problem.“
As a foundation for the rest of this book he then explains the 7 principles of Guerrilla Analytics. These cover practical day to day decisions about storage, documentation, automation, audit-able work, knowledge management & code design. For this is not a book for just leaders, it’s much more a practical handbook for a whole analytics team.
Practicing across the workflow
In the next part, the reader is walked through how to apply this approach throughout their workflow. Starting with Data Extraction, Enda shows how his 7 principles can be applied to improve practice. Some of the examples get into very specific detail. But the themes and lessons learnt help avoid this becoming too technical or distracting.
The diagram of the Guerrilla Analytics workflow is a useful map to this journey and would be a handy visual aide memoire to those seeking to work this way.
One of the real strengths of this book is how the author has peppered the text with two elements. First illustrations to summarise his points. Secondly and most importantly “war stories“. Real life examples of how things have gone wrong. These are so helpful in seeing the application to your work.
The stages usefully covered in detail are a useful checklist themselves:
- Data Extraction
- Data Receipt
- Data Load
- Consolidating Knowledge
- Work Products
Across these and the following section on Testing, Enda also shares Practice Tips (90 in total). These are another way of passing on practical leadership advice from experience of ‘hacks’ that work.
Testing Guerrilla Analytics
Rightly a focus that is identified as needed at all stages. To bring this to life for the reader, Enda focusses Part 3 on this topic.
Here he shares both principles (like ‘establishing a testing culture‘ and ‘test early‘) as well as then getting into practicalities. That practical application covers 3 chapters on what testing needs to look like at stages of the workflow.
For Data Testing, he share 5 Cs of data quality to be tested. For testing code he shares a more detailed step-by-step testing guide than I’ve seen for analytics programming. For testing products, Enda returns to a useful 5 Cs of testing for these & the extra steps for testing statistical models.
Building the capability you need
In the final section of this book, Enda shares his guidance on how to build such a capability.
Echoing themes we have covered before on this blog, he has advice on People, Process & Technology.
The chapter on People Capability stresses the importance of Softer Skills, Data Visualisation & Knowledge Management. These are in addition to the need for a number of technical skills, but he also echoes Martin’s call for an analytical attitude in people.
The Process chapter supports arguments I have made for more focus on workflow & methodologies. The technology chapter is a useful overview of the technology elements needed for a working Data Manipulation Environment (DME). That latter term is one Enda is fond of using throughout this book & which he richly describes.
Finally, there are useful Closing Thoughts & supporting appendices to motivate you to get started. With a comprehensive index, this book works well not just for an initial quick read, but ongoing for reference.
Do you need Guerrilla Analytics for your team?
In fact, I think it’s such a practical guide to the skills that analysts need to master in practice, that I use it with my university students. I teach a module of the MSc Data Science at University of South Wales and this is my recommended text.
Hope it also helps you educate and develop your team. Please share your thoughts or recommendations, especially if you’ve read this book. Well done, Enda, this is a really positive contribution to the analytics community.