Mining Biodiversity (MiBIO): innovative computational techniques to mine BHL texts
On January 15, 2014, ten international research funders from four countries jointly announced the winners of the third Digging into Data Challenge, a competition to develop new insights, tools and skills in innovative humanities and social science research using large-scale data analysis.
Fourteen international teams representing Canada, the Netherlands, the United Kingdom, and the United States will receive grants totaling approximately $5.1 million to investigate how computational techniques can be applied to “Big Data”. Their work will address the evolving nature of humanities and social sciences research, which often relies on massive multisource datasets. Each team represents collaborations among scholars, scientists, and information professionals from leading universities and libraries in Europe and North America.
The first round of the Digging into Data Challenge was held in 2009 and the second in 2011. Previous Digging into Data research projects have received international attention. The 2013 winners were selected from among 69 applications and address a wide variety of topics. Among the sponsoring bodies, the Institute of Museum and Library Services (IMLS) announced, on January 15, 2014, the winners of the third Digging into Data Challenge that they will fund. IMLS’s contribution of $424,591 supports the American researchers from three of the mentioned fourteen international teams: the Missouri Botanical Garden, the University of Wisconsin-Madison, and the University of South Carolina.
As part of an international team involving partners in UK, Canada and the USA, Missouri Botanical Garden was awarded $175,000 by the IMLS to lead the USA members of the Mining Biodiversity Project team in applying innovative computational techniques to biodiversity texts and help improve the way we view, access and share the knowledge contained within the Biodiversity Heritage Library.
The Mining Biodiversity (MiBIO) Project
Principal Investigators:
- (United Kingdom) Sophia Ananiadou, Director of the National Centre for Text Miningand Professor in the School of Computer Science at the University of Manchester;
- (Canada) Anatoliy Gruzd, Director of the SocialMedia Laband Associate Professor at Dalhousie University;
- (USA) William Ulate Rodríguez, Global Coordinator & Technical Director for US/UK of the Biodiversity Heritage Libraryand Senior Project Manager of the Center for Biodiversity Informatics at the Missouri Botanical Garden.
The MiBIO project is an international collaboration (UK, Canada, USA), leveraging large-scale data resources collected and curated by BHL members in US and worldwide, a digital portal developed and maintained by the BHL Technical staff at Missouri Botanical Garden and managed by the BHL Secretariat at the Smithsonian Institution Libraries, text mining resources developed by the UK partner (NaCTeM), and social media, visualization and text cleaning methodologies developed by the Canadian partner (Dalhousie).
The project’s prime objective is to promote the efficient access to and sharing of large-scale biodiversity legacy literature by the worldwide community of science libraries, museums and the interested public The project will integrate novel text mining methods, visualization, crowdsourcing and social media into the BHL to provide a semantic search system aimed to transform BHL into a next-generation social digital library resource that facilitates the study and discussion (via social media integration) of legacy science documents on biodiversity by a worldwide community.
The project’s prime objective is to promote the efficient access to and sharing of large-scale biodiversity legacy literature by the worldwide community of science libraries, museums and the interested public The project will integrate novel text mining methods, visualization, crowdsourcing and social media into the BHL to provide a semantic search system aimed to transform BHL into a next-generation social digital library resource that facilitates the study and discussion (via social media integration) of legacy science documents on biodiversity by a worldwide community.
By promoting the development of capabilities that will foster collaboration among researchers from the fields of History of Science, Environmental History, Environmental Studies, Library and Information Science and Social Media, the project hopes to make a significant impact on the above disciplines by (1) enriching a large-scale library, i.e., the BHL, via innovative application of Text Mining techniques to produce semantic metadata and a term inventory, (2) providing improved access to biodiversity-related digital artifacts via an enhanced search engine and visualization of results and (3) stimulating increased collaboration, interaction and sharing of information among BHL users via the social media environment.
The Institute of Museum and Library Services (IMLS) is the primary source of federal support for the nation’s 123,000 libraries and 17,500 museums. Our mission is to inspire libraries and museums to advance innovation, lifelong learning, and cultural and civic engagement. Our grant making, policy development, and research help libraries and museums deliver valuable services that make it possible for communities and individuals to thrive. To learn more, visit www.imls.gov and follow us on Facebook and Twitter.
In addition to the U.S. Institute of Museum and Library Services, the sponsoring funding bodies include
- Arts & Humanities Research Council (United Kingdom),
- Economic & Social Research Council (United Kingdom),
- Canada Foundation for Innovation (Canada),
- National Endowment for the Humanities (United States),
- Natural Sciences and Engineering Research Council (Canada),
- National Science Foundation (United States),
- Netherlands Organisation for Scientific Research in collaboration with The Netherlands eScience Center (NLeSC) (Netherlands),
- Social Sciences and Humanities Research Council (Canada).
The U.K. charity Jisc will be providing professional program management in the progression of the United Kingdom projects.
The Missouri Botanical Garden’s mission is “to discover and share knowledge about plants and their environment in order to preserve and enrich life.” Today, 153 years after opening, the Missouri Botanical Garden is a National Historic Landmark and a center for science, conservation, education and horticultural display.
The Center for Biodiversity Informatics (CBI) at the Missouri Botanical Garden seeks to provide innovative technology solutions to the global community of life science scholars in order to mobilize, integrate, and repatriate data about the world’s biodiversity.
The Center for Biodiversity Informatics (CBI) at the Missouri Botanical Garden seeks to provide innovative technology solutions to the global community of life science scholars in order to mobilize, integrate, and repatriate data about the world’s biodiversity.
Official Winners Announcement:http://www.diggingintodata.org/Home/AwardRecipientsRound32013/tabid/201/Default.aspx
IMLS Press Release:http://www.imls.gov/5.1_million_awarded_for_delving_into_big_humanities_and_social_science_data.aspx
Dalhousie University Press Release: http://www.dal.ca/news/2014/01/22/natural-history-for-the-digital-age.html
William Ulate R.
BHL Global Coordinator and BHL-US/UK Technical Director
Missouri Botanical Garden