Today, I engaged in a discussion with my teammate regarding the analysis of the fatal police shootings dataset:
We identified the following research questions for our analysis:
- Which agency IDs are associated with the highest number of fatal shooting incidents?
- What are the predominant race, age group, and threat type characteristics within these agency IDs that report the most fatal shooting incidents?
Based on guidance from our professor, we determined that it would be better to list all agencies, ranked by the number of shooting incidents attributed to each agency.
we implemented Python scripts to load the dataset and perform preliminary descriptive analyses. Specifically, we:
- Extracted key columns, including state, threat_type, race, and agency_ids.
- Assessed the dataset by determining the number of unique values within each column.
- Calculated the frequency distribution of values within each column