Presentations
The Avian Knowledge Network: A Hemisphere-wide Partnership to Organize, Analyze, and Visualize Bird Observation Data for Education, Conservation, Research, and Land Management--official program
Session Chairs: Steve Kelling (stk2@cornell.edu), Grant Ballard (gballard@prbo.org), Leo Salas (lsalas@prbo.org)
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
In February of 2007, the US NABCI Monitoring Subcommittee published a report calling for improvements in the effectiveness, scope, utility, and efficiency of bird monitoring. Presentations in this session will address three of the report's main goals: integrate monitoring into bird management and conservation; coordinate monitoring programs among organizations and across spatial scales; and improve statistical design. Download a PDF of the program describing the 14 February 2008 session.
What is the Avian Knowledge Network (AKN)?
Authors: Marshall Iliff and Steve Kelling
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which introduces the AKN by discussing the overarching vision of the AKN and its data management strategies.
Design and roles
of an AKN node
Authors: Leo Salas, Grant Ballard, Brian L. Sullivan, Geoffrey Geupel, C. John Ralph, and Mark P. Herzog.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses the access nodes and how the archiving, data organization, and data exploration at the local level is becoming an integral part of the AKN as a whole.
Data
Organization in the AKN
Authors: Denis Lepage
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses the fundamental data architecture of the AKN and how myriad types of data can be organized under a single structure and how this organization is the first critical step that allows for analysis across datasets.
The Avian Knowledge Alliance (AKA): An Outreach Network for Education
and Management.
Authors: David Hanni, Brian L. Sullivan, Geoffrey R. Geupel, Jaime L. Stephens, Grant Ballard, and John D. Alexander.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses the Avian Knowledge Alliance (AKA), a cooperative effort of NGOs that help both to promote the AKN and to provide the critical link between AKN scientists and land managers, to better direct on-the-ground conservation efforts and to steer the development of AKN visualizations and tools to make them most useful.
Exploring
Observational Data Using Visualization Tools.
Authors: Brian L. Sullivan, Steve Kelling, Christopher L. Wood, Marshall J. Iliff, Daniel Fink, Grant Ballard, Mark Herzog and Doug Moody.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses the various ways to visualize data in the AKN: from maps, graphs, and charts that simply explore the raw data to advanced predictive modeling.
Web
Mapping for Bird Conservation: Leveraging the AKN.
Authors: Douglas A. Miller; S. R. Crawford, Terry D. Rich, and Steve T. Kelling.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which displays just one way in whcih AKN data have been used to further our understanding of birds in the Americas. This elegant mapping tool summarizes raw AKN data on a map and is customizable by month, and can view those data with respect to various other map layers including Nature Serve range maps, Federal land holdings, etc.
Exploring Bird Monitoring Data to Guide Management and Research
Decisions: Predicting Relative Abundance with Decision Trees.
Authors: Daniel Fink, Wesley M. Hochachka, and Nadav Nur.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses how exploratory analysis (decision trees) was used to generate extremely accurate predictive maps of bird distribution, based upon count data from eBird with respect to land cover data stored within the AKN.
Elucidating Ecological Factors Influencing Variation in Relative
Abundance of Riparian Species in California: The Significance of
Spatial Scale.
Authors: Nadav Nur, Daniel Fink, Wesley M. Hochachka, and Mark P. Herzog.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses how the bagged decision tree analysis discussed in the previous presentation was applied to point count data from California to generate predictive maps of the distribution of certain riparian species.
Integrated Bird Monitoring and the Avian Knowledge Network: Using Multiple Data Resources to Understand Spatial Variation in Demographic Processes and Abundance.
Authors: James Saracco°, David F. DeSante, Phil M. Nott, Wesley M. Hochachka, and Steve Kelling.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses how banding data from MAPS (Monitoring Avian Productivity and Survivorship) is being used to variations in demographic processes among birds. Future research will try to integrate bird count data with MAPS data to try to further clarify spatial variation in demographics.
A New Paradigm from Constant-effort Banding and Census for a Sensitive and Comprehensive Understanding of Landbird Life History Phenomena.
Authors: C. John Ralph, Leo Salas, and Steve Kelling.
Location & Date: Fourth International Partners in Flight (PIF) Conference in McAllen, Texas 13-16 February 2008.
Download this Powerpoint presentation, which discusses how banding data and observation data have been analyzed in tandem to clarify our understanding of the Swainson's Thrush migration through northern California and southern Oregon.
Using data mining techniques in modeling of bird distribution data
Authors: W. M. Hochachka, Rich Caruana, Daniel Fink, Art Munson, Mirek Riedewald, Daria Sorokina, and Steve Kelling
Location & Date: 17th International Meeting of the European Bird Census Council. Chiavenna, Italy (April 2007)
Description: This talk was a summary of the material covered in the paper (to appear in the August 2007 issue of the Journal of Wildlife Management): an introduction to the philosophy and methods of data mining for ecologists. Examples were used to briefly introduce the audience to three strengths of data mining analyses: (1) production of accurate predictions, (2) identification of important predictor variables, and (3) identification of functional forms of relationships between predictors and the response variable.
keywords/keyphrases: data mining, exploratory analysis, bird distribution
Detecting Statistical Interactions with Groves of Trees
Authors: Daria Sorokina, Rich Caruana, Mirek Riedewald, Daniel Fink
Location & Date: The Second North East Student Colloquium on Artificial Intelligence (NESCAI'07), Cornell University, Ithaca, NY, April 2007.
Description: We propose a new approach for the problem of interaction detection based on comparing performance of different regression models. Our method is based on a new machine learning algorithm, a Grove of trees, which combines additive models with regression trees in a way that allows variable interactions to be carefully controlled. By comparing the performance of restricted and unrestricted groves of trees, the existence and degree of variable interactions in the response function can be reliably detected and estimated.
Semiparametric Analysis of Large-Scale Observational Data Using Hierarchical Predictive Models
Authors: Daniel Fink and Wesley M. Hochachka
Location & Date: The European Union
for Bird Ringing (EURING) Technical Meetings, Dunedin, New Zealand
(January 2007)
Description: Hierarchical models have emerged as the preferred tool for analyzing large sets of observational data, because (1) complicated, multifaceted processes can be factored into a series of simpler, conditionally independent sub-processes, and (2) a wide variety of parametric models can be incorporated and their validity explicitly tested. However, there are many problems where there is insufficient a priori knowledge to justifiably specify parametric models at all stages of the hierarchy, even though an accurate (predictive) model is desired or even needed. For example, management of threatened or endangered species may require predictions of species' habitat preferences or responses to habitat alteration, even though insufficient information is available to construct a parametric model that is known to be a good abstraction of reality. Ecological problems characterized by having more predictor information than prior knowledge are only likely to increase as large data sets become easier to obtain. For example, the Avian Knowledge Network (AKN) currently contains millions of bird monitoring records linked to nearly 1000 landscape predictors. As the number of predictors grows it becomes more difficult to determine what predictors most affect the distribution and abundance of bird populations and the specific character of their effects for parametric modeling.
We propose a new semiparametric regression technique which we call the hierarchical predictive model (HPM) to produce highly accurate predictions even when a fully parametric model cannot be specified with confidence. HPMs specify the parametric hierarchy, where justified, while relying on the complementary strengths of powerful nonparametric data mining methods to automatically discover and fit important structure elsewhere in the hierarchy. The practical appeal of this approach is that it allows one to include as much parametric structure as is justified by subject-area knowledge. At the same time, this semiparametric regression model employs nonparametric techniques to automatically account for additional predictors and processes that are less well understood. This makes HPMs well suited for the exploratory analysis of large observational data sets, especially data sets containing large numbers of potentially informative covariates.
Exploring the ecological consistency of bird conservation regions across a gradient of human density
Authors: W. M. Hochachka, D. Fink, D. N. Bonter, R. A. Caruana, S. T. Kelling, A. Munson, M. Riedewald, D. Sorokina
Location & Date: 4th North American Ornithological Congress. Veracruz, Mexico. (October 2006)
Description: The impact of humans of their environment varies, and one axis of this variation is along a gradient of human population density. In our talk we exampled whether the impacts of varying human density could be extrapolated from one ecological region within North America to another, using the NABCI Bird Conservation Regions (BCRs) to define ecological regions. We found that, even qualitatively, the relationships between human density and bird abundance would vary among BCRs for some of the 17 species that we examined in our analyses of data from the eastern U.S. And adjacent Canada.
keywords/keyphrases: rural – urban gradient, Bird Conservation Region (BCR), prevalence, urbanization
Data mining to explore spatial and temporal variation in bird distribution: irruptive winter migrants
Authors: D. Fink, W.M. Hochachka, R. Caruana, S. Kelling, A. Munson, M. Riedewald, D. Sorokin
Location & Date: 4th North American Ornithological Congress. Veracruz, Mexico. (October 2006)
Description:The ability to describe and visualize spatial and temporal variation in bird distribution, without presupposing specific underlying patterns, is important during exploration of bird monitoring data. We demonstrate how data mining techniques can be used to model spatial and temporal variation from landscape level monitoring data while controlling for variation in detectability based on covariate information. We use these techniques to explore the irruptive winter migrations of several species in the Eastern United States. The analyses are based on data from the citizen-science based winter monitoring program, Project FeederWatch. Variation in detection rates is modeled as a function of effort spent watching birds as well as effort spent attracting birds to back yard feeders. We describe how migration patterns vary over regions and within the winter season. Statistical methods are then used to test for associations between winter migration patterns and large-scale habitat characteristics.
Hierarchical Predictive Models
Author:Daniel Fink
Location & Date: Interface 2006, 38th Symposium on the Interface of Statistics, Computing Science, and Applications. Massive Data Sets and Streams.(Pasadena, California, May 2006)
Description:Semiparametric regression models incorporate flexible nonparametric components within a parametric hierarchical modeling framework. The practical appeal of this approach is that it allows one to include as much parametric structure as is justified by subject-area knowledge. At the same time, the semiparametric regression model employs nonparametric techniques to account for additional predictors and processes that are less well understood. Most semiparametric regression techniques cannot model more than a handful of predictors nonparametrically. Methodology for including a general class of nonparametric predictive models within the hierarchical framework is presented for the regression and binary classification problems. Utilizing data-mining techniques for the predictive model we show that many more predictors can be handled nonparametrically. We also show that this method can be viewed as a general approach for extending data-mining techniques to deal with dependent data. Simulation studies are used to evaluate the hierarchical predictive models. The method is also used to predict patterns of variation in North American bird populations from a large spatial data set. The information from these models provides useful information for conservation and land management.
The value of predicting variation in distribution and abundance of birds using data-mining and machine learning techniques.
Authors:
D. FINK, W . HOCHACHKA, M. RIEDEWALD and S. KELLING
Location & Date: American Ornithological Union, (Santa Barbara, California, 2005)
Description:
Predicting variation in the distribution and abundance of birds across
a landscape is important from many perspectives, including use of this
information in conservation and management.
Traditionally, ornithologists have used parametric statistical
techniques to identify environmental predictors of birds’
distributions. However, suites of potentially suitable techniques for
the same purpose are actively under development in the fields of
data-mining and machine learning, and to date these techniques are
largely unknown to ornithologists. In this talk we will introduce
several of these methods (including support vector machines and
decision trees), explain the relative strengths of these methods in
comparison to the more familiar parametric classification techniques,
and assess the relative predictive power of multiple techniques in
describing bird distributions from simulated data in which the
true distributions are known. Our results indicate that these new
techniques, when coupled with data on both presence and absence of
birds, are a tool that should be more widely used by
ornithologists.