Are data-mining techniques useful for selecting ecological indicators in biodiverse regions? Bridges between market basket analysis and indicator value analysis from a case study in the neotropics


Butantan affiliation
Publication type
Article
Language
English
Access rights
Restricted access
Appears in Collections:
Metrics
Abstract
Ecological monitoring research relies heavily on signals to detect ecosystem changes, making the selection of indicators a crucial methodological requirement. Over the years, individual species and species assemblages have been widely used, thereby, giving rise to reference methods that support the detection of ecological indicators. One such method, the Indicator Value Analysis (IndVal), has been adapted to identify not only species but also combinations of species, assuming collective responses to environmental factors. However, the IndVal method requires a pre-selection of species before performing the analysis, especially in the case of large datasets (e.g. high species richness), when it becomes ineffective. Species pre-selection might introduce subjectivity and a bias into the database, which can cause possible impacts on the final set of indicators. To address these issues, the authors propose the use of Market Basket Analysis (MBA) – a data mining method – which is mathematically similar to IndVal but designed to handle large amounts of data. Both methods were applied to select indicators from gradually larger datasets of Soil Surface Dwelling Arthropods from the Brazilian Amazon, using threshold-dependent indices to assess concordance between results. In general, the results obtained by applying both methods were found to be similar, with an average Jaccard's distance of 0.432 (±0.346) and an average True Skill Statistic of 0.991 (±0.012). As expected, MBA was able to select ecological indicators without species pre-selection as well as from datasets where IndVal had been unsuccessful. In such cases, and by means of objective association rules, the authors demonstrate that MBA could be used to pre-select ecological indicators, which can then be further processed and summarized with the IndVal method. In this study, the authors briefly outline the potential of MBA to complement IndVal and discuss advantages and disadvantages of using MBA for ecological indicators (pre-) selection.
Reference
Leote P, Cajaiba RL, Cabral JA, Brescovit AD, Santos M. Are data-mining techniques useful for selecting ecological indicators in biodiverse regions? bridges between market basket analysis and indicator value analysis from a case study in the neotropics. Ecol. Indic. 2020 Feb;109:10583. doi:10.1016/j.ecolind.2019.105833Get.
Link to cite this reference
https://repositorio.butantan.gov.br/handle/butantan/4123
URL
https://doi.org/10.1016/j.ecolind.2019.105833
Journal title
Issue Date
2020


Files in This Item:

Existing users please Login
1-s2.0-S1470160X19308271-main.pdf
Description:
Size: 1.01 MB
Format: Adobe PDF
Embargoed until January 1, 2999    Request a copy
Show full item record

The access to the publications deposited in this repository respects the licenses from journals and publishers.