Today, I practiced using the K-means clustering algorithm on the ACLED dataset in the U.S. I discovered that the locations of events are not randomly distributed; instead, they tend to cluster in certain regions. This pattern forms distinct clusters. By applying the K-means algorithm, I was able to divide the data into two clusters.
Additionally, I’ve learned different methods to determine whether data is randomly distributed, such as using the uniform and Poisson distributions. Moving forward, I plan to use this dataset to predict future events based on location and time.