There have thousands of crimes are happening daily all around. But people keep statistics only few of them, therefore crime rates are increasing day by day. The reason behind can be less concern or less statistics of previous crimes. It is much more important to observe the previous crime statistics for general people to make their outing decision and police for catching the criminals are taking steps to restrain the crimes and tourists to make their travelling decision. National institute of justice releases crime survey data for the country, but does not offer crime statistics up to Union or Thana level. Considering all of these cases we have come up with an approach which can give an approximation to people about the safety of a specific location with crime ranking of different areas locating the crimes on a map including a future crime occurrence prediction mechanism. Our approach relies on different online Bangla newspapers for crawling the crime data, stemming and keyword extraction, location finding algorithm, cosine similarity, naive Bayes classifier, and a custom crime prediction model
翻译:每天全球各地都会发生数千起犯罪事件。但人们仅对其中少数进行统计,因此犯罪率日益上升。其背后的原因可能是对先前犯罪事件的关注不足或统计不充分。对于普通民众来说,观察以往的犯罪统计数据对于做出行决策至关重要;对于警方而言,这些数据有助于抓捕罪犯并采取措施遏制犯罪;对于游客来说,则有助于做出旅行决策。美国国家司法研究所会发布全国犯罪调查数据,但并未提供至联合区或警区级别的犯罪统计。考虑到这些情况,我们提出了一种方法,该方法能够通过地图上不同区域的犯罪定位、犯罪排名,以及未来犯罪事件预测机制,向人们提供特定地点安全性的近似评估。我们的方法依赖于不同的孟加拉语在线报纸来爬取犯罪数据,并涉及词干提取与关键词提取、位置查找算法、余弦相似度、朴素贝叶斯分类器以及一个自定义的犯罪预测模型。