BirdCollect: A Comprehensive Benchmark for Analyzing Dense Bird Flock Attributes

1IIT Jodhpur, India 2IIIT Delhi, India 3University of South Florida, Tampa, Florida, USA 4Crane Conservationist, Khichan, India
AAAI 2024 (AISI)

Abstract

Automatic recognition of bird behavior from long-term, uncontrolled outdoor imagery can contribute to conservation efforts by enabling large-scale monitoring of bird populations. Current techniques in AI-based wildlife monitoring have focused on short-term tracking and monitoring birds individually rather than in species-rich flocks. We present BirdCollect, a comprehensive benchmark dataset for monitoring dense bird flock attributes.It includes a unique collection of more than 6,000 high-resolution images of Demoiselle Cranes (Anthropoides virgo) feeding and nesting in the vicinity of Khichan region of Rajasthan. Particularly, each image contains an average of 190 individual birds, illustrating the complex dynamics of densely populated bird flocks on a scale that has not previously been studied. In addition, a total of 433 distinct pictures captured at Keoladeo National Park, Bharatpur provide a comprehensive representation of 34 distinct bird species belonging to various taxonomic groups. These images offer details into the diversity and the behaviour of birds in vital natural ecosystems along the migratory flyways. Additionally, we provide a set of 2,500 point-annotated samples which serve as ground truth for benchmarking various computer vision tasks like crowd counting, density estimation, segmentation, and species classification.The benchmark performance for these tasks highlight the need for tailored approaches for specific wildlife applications, which include varied conditions including views, illumination, and resolutions. With around 46.2 GBs in size encompassing data collected from two distinct nesting ground sets, it is the largest birds dataset containing detailed annotations, showcasing a substantial leap in bird research possibilities.

Visual description outlining the potential applications and challenges associated with our proposed dataset BirdCollect

Video Samples

BirdCrowd captures diverse bird actions like feeding, grooming, and social interactions in high-resolution videos. Cropped to short length due to size constrants, these samples offer valuable data for computer vision and animal behavior research.


Segmentation Samples

Lorem ipsum dolor, sit amet consectetur adipisicing elit. Modi autem quisquam earum magni saepe ex, neque necessitatibus id minima ipsa dignissimos voluptate, explicabo tempora ut doloribus? Ducimus eum iusto a? Lorem ipsum dolor sit amet consectetur adipisicing elit. Reprehenderit ipsum fuga libero! Amet voluptas eveniet ipsam blanditiis harum quaerat magni pariatur placeat aut! Sequi quaerat sed ipsam reprehenderit ratione distinctio.

Crowd Counting Samples

Lorem ipsum dolor, sit amet consectetur adipisicing elit. Modi autem quisquam earum magni saepe ex, neque necessitatibus id minima ipsa dignissimos voluptate, explicabo tempora ut doloribus? Ducimus eum iusto a? Lorem ipsum dolor sit amet consectetur adipisicing elit. Reprehenderit ipsum fuga libero! Amet voluptas eveniet ipsam blanditiis harum quaerat magni pariatur placeat aut! Sequi quaerat sed ipsam reprehenderit ratione distinctio.

Species Identification

Lorem ipsum dolor, sit amet consectetur adipisicing elit. Modi autem quisquam earum magni saepe ex, neque necessitatibus id minima ipsa dignissimos voluptate, explicabo tempora ut doloribus? Ducimus eum iusto a? Lorem ipsum dolor sit amet consectetur adipisicing elit. Reprehenderit ipsum fuga libero! Amet voluptas eveniet ipsam blanditiis harum quaerat magni pariatur placeat aut! Sequi quaerat sed ipsam reprehenderit ratione distinctio.

Dataset Details

We present the BirdCollect dataset with the objective to prepare an annotated benchmark dataset for promoting design and development of algorithms for long term ethograming of birds. This is one of the largest datasets with detailed annotations available in the research community for bird monitoring and analysis. The proposed dataset aims to address the following key research questions

(a) Details of the number of images of birds collected from two distinct sites, hereafter referred to as Site-1, Khichan village (K, S1) and Site-2, Bharatpur (B, S1). (b) Details of annotated samples corresponding to different resolution and camera sensors.

Click images to see more examples


Conclusion

The development of automated, non-invasive technologies for monitoring large bird flocks, using computer vision for crowd counting and density estimation, plays a crucial role in bird behavior analysis. These techniques allow for detailed studies of migration patterns and spatial distribution, offering vital ecological insights into factors like climate change and ecosystem health. Semantic segmentation of flocks enhances understanding of behavior dynamics, aiding in effective ecological monitoring and conservation efforts. This work supports key UN Sustainable Development Goals (SDGs), particularly those related to climate action and life on land, contributing to the 2030 Agenda for Sustainable Development by promoting wildlife conservation and community well-being.

Acknowledgements

TWe would like to express our sincere gratitude to the National Science Foundation (NSF) and the Technology Innovation Hub on Computer Vision, Augmented Reality and Virtual Reality (iHub Drishti) at the Indian Institute of Technology Jodhpur (IIT Jodhpur) for their generous support of this research.

BibTeX

      
        @inproceedings{birdcollect,
        title={BirdCollect: A Comprehensive Benchmark for Analyzing Dense Bird Flock Attributes},
        author={Kshitiz, Sonu Shreshtha, Bikash Dutta, Muskan Dosi, Mayank Vatsa, Richa Singh, Saket Anand, Sudeep Sarkar, Sevaram Mali Parihar},
        journal={AAAI 2024},
        year={2021}
        }