HOD: A Benchmark Dataset for Harmful Object Detection
Abstract: Recent multi-media data such as images and videos have been rapidly spread out on various online services such as social network services (SNS). With the explosive growth of online media services, the number of image content that may harm users is also growing exponentially. Thus, most recent online platforms such as Facebook and Instagram have adopted content filtering systems to prevent the prevalence of harmful content and reduce the possible risk of adverse effects on users. Unfortunately, computer vision research on detecting harmful content has not yet attracted attention enough. Users of each platform still manually click the report button to recognize patterns of harmful content they dislike when exposed to harmful content. However, the problem with manual reporting is that users are already exposed to harmful content. To address these issues, our research goal in this work is to develop automatic harmful object detection systems for online services. We present a new benchmark dataset for harmful object detection. Unlike most related studies focusing on a small subset of object categories, our dataset addresses various categories. Specifically, our proposed dataset contains more than 10,000 images across 6 categories that might be harmful, consisting of not only normal cases but also hard cases that are difficult to detect. Moreover, we have conducted extensive experiments to evaluate the effectiveness of our proposed dataset. We have utilized the recently proposed state-of-the-art (SOTA) object detection architectures and demonstrated our proposed dataset can be greatly useful for the real-time harmful object detection task. The whole source codes and datasets are publicly accessible at https://github.com/poori-nuna/HOD-Benchmark-Dataset.
- Alcohol detection for car locking system. In 2018 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE), pages 230–233. IEEE, 2018.
- Review of deep learning: Concepts, cnn architectures, challenges, applications, future directions. Journal of big Data, 8:1–74, 2021.
- The influence of media violence on youth. Psychological science in the public interest, 4(3):81–110, 2003.
- Forensic application of the luminol reaction as a presumptive test for latent blood detection. Talanta, 72(3):896–913, 2007.
- Weapon detection in real-time cctv videos using deep learning. IEEE Access, 9:34366–34382, 2021.
- Alcohol detection and vehicle controlling. International Journal of Engineering Trends and Applications (IJETA), 2(2):92–97, 2015.
- Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934, 2020.
- Short-term and long-term effects of violent media on aggression in children and adults. Archives of pediatrics & adolescent medicine, 160(4):348–352, 2006.
- Joanne Cantor. Children’s attraction to violent television programming. Why we watch: The attractions of violent entertainment, pages 88–115, 1998.
- Dangerous behaviors detection based on deep learning. In Proceedings of the 2nd International Conference on Artificial Intelligence and Pattern Recognition, pages 24–27, 2019.
- Mmdetection: Open mmlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155, 2019.
- “i luv u:)!”: A descriptive study of the media use of individuals in romantic relationships. Family Relations, 60(2):150–162, 2011.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
- Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6569–6578, 2019.
- The pascal visual object classes (voc) challenge. International journal of computer vision, 88:303–338, 2010.
- Gun and knife detection based on faster r-cnn for video surveillance. In Iberian conference on pattern recognition and image analysis, pages 441–452. Springer, 2019.
- Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.
- Real-time gun detection in cctv: An open problem. Neural networks, 132:297–308, 2020.
- Age at onset of alcohol use and its association with dsm-iv alcohol abuse and dependence: results from the national longitudinal alcohol epidemiologic survey. Journal of substance abuse, 9:103–110, 1997.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Speed/accuracy trade-offs for modern convolutional object detectors. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7310–7311, 2017.
- Longitudinal relations between children’s exposure to tv violence and their aggressive and violent behavior in young adulthood: 1977-1992. Developmental psychology, 39(2):201, 2003.
- Real-time abnormal object detection for video surveillance in smart cities. Sensors, 22(10):3862, 2022.
- Alcohol detection system. IJRCCT, 3(1):059–064, 2014.
- Image analytics to detect cigarette in an image using deep learning. In Advances in Signal and Data Processing: Select Proceedings of ICSDP 2019, pages 659–678. Springer, 2021.
- An analysis of feature extraction and classification algorithms for dangerous object detection. In 2017 2nd International Conference on Electrical & Electronic Engineering (ICEEE), pages 1–4. IEEE, 2017.
- Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
- Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3431–3440, 2015.
- Yolo-cigarette: An effective yolo network for outdoor smoking real-time object detection. In 2021 Ninth International Conference on Advanced Cloud and Big Data (CBD), pages 121–126, 2022.
- Real-time sign language detection using human pose estimation. In Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 237–248. Springer, 2020.
- Weapon detection using yolo v3 for smart surveillance system. Mathematical Problems in Engineering, 2021:1–9, 2021.
- Automatic handgun detection alarm in videos using deep learning. Neurocomputing, 275:66–72, 2018.
- World Health Organization. WHO report on the global tobacco epidemic, 2008: the MPOWER package. World Health Organization, 2008.
- Youth exposure to violence involving a gun: evidence for adverse childhood experience classification. Journal of behavioral medicine, 42:646–657, 2019.
- You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016.
- Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
- Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28, 2015.
- A new data glove approach for malaysian sign language detection. Procedia Computer Science, 76:60–67, 2015.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
- Harmful effects of exposure to media violence: Learning of aggression, emotional desensitization, and fear. In Human aggression, pages 167–202. Elsevier, 1998.
- Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR, 2019.
- Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10781–10790, 2020.
- Detection of subtype blood cells using deep learning. Cognitive Systems Research, 52:1036–1044, 2018.
- Fighting against terrorism: A real-time cctv autonomous weapons detection based on improved yolo v4. Digital Signal Processing, 132:103790, 2022.
- Deep high-resolution representation learning for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 43(10):3349–3364, 2020.
- Lite-hrnet: A lightweight high-resolution network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10440–10450, 2021.
- Deformable detr: Deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159, 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.