Hi, I'm Alistair!
I Help You Understand Data
If you’re managing a business, you have lots of day to day data. I help you understand what the data tells us about your operation in a fun, visual format.
See some of the projects I've worked on:
No Cleaning or Presentations. Only Queries & Visualizations.
Security Indident Analysis
NB: All data and insights are purely hypothetical, the data is fake, there are no mentions of any Comapny or Organization, any similarities to any Organization are purely coincidental.
This project focused on looking at Security Incidents in a logistics (FMCG) setting.
To understand our process, let’s quickly walk through it step by step:
- Sit with the client to understand the project and their expectations, get all the questions they want answered and in what format.
- Create a plan to gather all the required tools and information, creating a project dashboard, schedules and sharing all information with relevant parties.
- Gather only relevant data in different formats (CSV, Streams, XML, API, Databases, etc) – Update shared project file.
- Isolate and scan data files for security threats using relevant tools – update shared project file.
- Clean Data (this step we left out in this project, as an example only) – update shared project file.
- Store Data into local and cloud based repositories/Databases to query later – update shared project file.
- Make a Copy of data, clearly labelling each version and documenting each step – update shared project file.
- Write queries in the relevant language, ie Python, SQL or simple Excel commands. Document the queries – update shared project file.
- Create visualizations from results of queries – update shared project file.
- Compile all data insights and visualizations into the required presentation format (Slide Presentation, Video or step by step walkthrough) – update shared project file.
- Deliver presentation to client, getting feedback.
Go through data to find any other recommendations to make outside of their requests, offering more value for money.
Insights Requested From This Data
- How many incidents are there per location?
- What types of incidents are there and how many?
- Whats the ratio of severity?
- Are all incidents resolved on time?
- Can incidents be attributed to certain staff on duty?
I used ChatGPT to help me formulate some data table headers.
I then used Mockaroo to create the fake data and downloaded it in CSV format.
I scanned the downloaded .csv file with my local Spybot softwre to check for any threats, there were none.
The data was saved into my local SQlite database.
I then used ChatGPT to write code and inserted the resulting code into my Jupyter Notebook, running on Annaconda.
The resulting code can be found below, in the embedded PDF.
I also used ChatGPT to write a brief Report about the Analysis, the Report is below.
Let's Help You Understand Data
01.
Understand
You explain your business so we understand what you want to now
02.
Plan
We create a plan to collect the necessary information to do the analysis.
03.
Investigate
We investigate and uncover the sources of data that will be needed to do the analysis
04.
Collect Data
Data is collected from various sources and in multiple formats as necessary
05.
Scanned
Collected data is scanned for viruses and other threats before being moved
06.
Clean, Format
Data is cleaned and refotmatted so that it can be safely stored in a database
07.
Query
We use multiple tools from SQL to excel to query the data and uncover answers
08.
Presentation
A complete presentation is delivered answering questions
Security Incident Analysis Report
Introduction
This report provides a comprehensive overview of the analysis conducted on the Incidents database. The purpose of this analysis was to explore, understand, and summarize the data stored in the database, and to illustrate the types of queries and insights that can be derived from it.
The database contains detailed records of incidents, including the time, location, personnel involved, and outcomes. This report will describe the methodology, types of queries executed, visualizations prepared, and the insights that can be gained from such an analysis.
Database Structure
The Incidents database contains a single table named ‘Incidents’.
This table includes the following columns:
– Incident_ID (INTEGER): Unique identifier for each incident.
– Date (TEXT): Date of the incident.
– Time (TEXT): Time of the incident.
– Employee_ID (TEXT): Identifier of the employee involved.
– Incident_Type (TEXT): Classification of the incident (e.g., safety, security, operational).
– Location (TEXT): Area or department where the incident occurred.
– Severity (TEXT): Severity level of the incident (e.g., minor, major, critical).
– Status (TEXT): Current status of the incident (e.g., open, closed, under investigation).
– Actions_Taken (TEXT): Summary of corrective or preventive measures applied.
Analysis and Queries
The analysis of the Incidents database involved a variety of queries to explore the patterns, trends, and characteristics of the recorded incidents.
The following types of queries were designed and conceptually executed:
- Monthly Incident Counts: Aggregating incidents per month to visualize trends over time.
- Incidents by Type: Counting the number of incidents of each type to identify which types occur most frequently.
- Incidents by Location: Grouping incidents by location to detect areas with higher incident rates.
- Severity Analysis: Summarizing incidents by severity level to understand the distribution of incident impact.
- Status Tracking: Counting incidents based on their current status to monitor resolution efficiency.
- Employee Involvement: Counting incidents associated with each employee to identify patterns or potential issues.
- Time-based Analysis: Analyzing incidents by time of day to detect temporal trends or risk periods.
- Cross-Analysis: Combining multiple columns, such as Severity vs Location or Type vs Status, to identify correlations and actionable insights.
Visualization Strategy
To better understand the data, visualizations were conceptualized based on the queries. Examples include:
– Line charts for monthly incident trends.
– Bar charts for incident type distribution.
– Heatmaps for location vs severity to quickly spot critical areas.
– Stacked bar charts to show status distribution over time.
– Pie charts for severity proportions.
These visualizations help in communicating key findings and highlighting areas that may require immediate attention.
Insights and Implications
From the conceptual analysis and queries, several insights can be derived:
1. Peak months with higher incident counts can be identified, which may correlate with operational load, seasonal factors, or staffing changes.
2. Certain incident types may dominate, indicating areas where preventive measures could be strengthened.
3. Locations with higher incident rates can be prioritized for safety audits, training, or policy improvements.
4. Severity patterns help management allocate resources and attention to critical incidents.
5. Tracking incident resolution status ensures timely follow-ups and compliance with organizational policies.
6. Employee-related trends could highlight the need for additional training, supervision, or workflow adjustments.
7. Temporal patterns in incident occurrences allow for proactive interventions during high-risk periods.
Overall, these insights support data-driven decision-making to reduce incidents, improve safety and security, and optimize operational processes.
Conclusion
In conclusion, the conceptual analysis of the Incidents database demonstrates the wealth of insights that can be extracted from structured incident records.
By systematically querying the data and designing visualizations, management can identify patterns, allocate resources effectively, and implement strategies to mitigate risks.
This report highlights the potential of data-driven approaches to enhance operational efficiency and safety, even without directly querying the live data. The methodologies and queries described can be adapted and extended to monitor ongoing incidents and to create dynamic dashboards for real-time operational awareness.
About Me
Your Data, Your Business!
Every second, your business generates data.
To most people, the data is a pile of numbers and letters. To the trained analyst though, the data tells a story.
I help uncover the stories and present it in a fun visual format, that’s easy for anyone to understand.