Predictive capacity of nine algorithms and an ensemble model to determine the geographic distribution of tree species
Juan Carlos Montoya-Jiménez (1), José René Valdez-Lazalde (1) , Gregorio Ángeles-Perez (1), Héctor Manuel De Los Santos-Posadas (1), Gustavo Cruz-Cárdenas (2)
iForest - Biogeosciences and Forestry, Volume 15, Issue 5, Pages 363-371 (2022)
doi: https://doi.org/10.3832/ifor4084-015
Published: Sep 20, 2022 - Copyright © 2022 SISEF
Research Articles
Abstract
The different models that predict the distribution of species are a useful tool for the evaluation and monitoring of forest resources as they facilitate the planning of their management in a changing climate environment. Recently, a significant number of algorithms have been proposed for this purpose, making it difficult to select the most appropriate to use. The evaluation of performance and predictive stability of these models can elucidate this problem. Distribution data of 17 pine species with high economic importance for Mexico were collected and distribution models were carried out. We carried out a pre-modeling design to select the prediction variables (climatic, edaphic and topographic), after which nine algorithms and an ensemble model were contrasted against one another. The true skill statistic (TSS) and the area under the curve (AUC) were used to evaluate the predictive performance of the models, and the coefficient of variation of the predictions was used to evaluate their stability. The number of predictive variables in the final models fluctuated from 6 to 12; the mean diurnal range and the maximum temperature of warmest month were included in the models for most species. Random forests, the ensemble model, generalized additive models and MaxEnt were the ones that best described the distribution of the species (AUC >0.92 and TSS >0.72); the opposite was found in Bioclim and Domain (AUC<0.75 and <0.82; and TSS<0.5 and <0.55). Support vector machine, Mahalanobis distance, generalized linear models and boosted regression trees obtained intermediate settings. The coefficient of variation indicated that Bioclim, Domain and Support vector machine have low predictive stability (CV>0.055); on the contrary, Maxent and the ensemble model attained high predictive stability (CV<0.015). The ensemble model obtained greater performance and predictive stability in the predictions of the distribution of the 17 species of pines. The differences found in performance and predictive stability of the algorithms suggest that the ensemble model has the potential to model the distribution of tree species.
Keywords
TSS, AUC, BRT, SVM, MaxEnt, Random Forests, GAM, Ensemble Model
Authors’ Info
Authors’ address
José René Valdez-Lazalde 0000-0003-1888-6914
Gregorio Ángeles-Perez 0000-0002-9550-2825
Héctor Manuel De Los Santos-Posadas 0000-0003-4076-5043
Postgrado en Ciencias Forestales. Colegio de Postgraduados. Campus Montecillo (México)
Instituto Politécnico Nacional, CIIDIR-IPN-Michoacán, COFAA, Justo Sierra 28, 59510 Jiquilpan, Michoacán (México)
Corresponding author
Paper Info
Citation
Montoya-Jiménez JC, Valdez-Lazalde JR, Ángeles-Perez G, De Los Santos-Posadas HM, Cruz-Cárdenas G (2022). Predictive capacity of nine algorithms and an ensemble model to determine the geographic distribution of tree species. iForest 15: 363-371. - doi: 10.3832/ifor4084-015
Academic Editor
Maurizio Marchi
Paper history
Received: Feb 22, 2022
Accepted: Jul 12, 2022
First online: Sep 20, 2022
Publication Date: Oct 31, 2022
Publication Time: 2.33 months
Copyright Information
© SISEF - The Italian Society of Silviculture and Forest Ecology 2022
Open Access
This article is distributed under the terms of the Creative Commons Attribution-Non Commercial 4.0 International (https://creativecommons.org/licenses/by-nc/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Web Metrics
Breakdown by View Type
Article Usage
Total Article Views: 24746
(from publication date up to now)
Breakdown by View Type
HTML Page Views: 22160
Abstract Page Views: 1363
PDF Downloads: 984
Citation/Reference Downloads: 10
XML Downloads: 229
Web Metrics
Days since publication: 793
Overall contacts: 24746
Avg. contacts per week: 218.44
Citation Metrics
Article Citations
Article citations are based on data periodically collected from the Clarivate Web of Science web site
(last update: Feb 2023)
(No citations were found up to date. Please come back later)
Publication Metrics
by Dimensions ©
Articles citing this article
List of the papers citing this article based on CrossRef Cited-by.
References
Distribución potencial de 20 especies de pinos en México [Potential distribution of 20 pine species in Mexico]. Agrociencia 52: 1043-1057. [in Spanish]
Gscholar
Principles and methods of data cleaning - Primary species and species-occurrence data (version 1.0). Report for the Global Biodiversity Information Facility - GBIF, Copenhagen, Denmark, pp. 72.
Gscholar
Novel methods improve prediction of species’ distributions from occurrence data. Ecography 29 (2): 129-151.
CrossRef | Gscholar
Distribución potencial de Pinus cembroides, Pinus nelsonii y Pinus culminicola en el Noreste de México [Potential distribution of Pinus cembroides, Pinus nelsonii and Pinus culminicola in northeastern Mexico]. Ecosistemas y Recursos Agropecuarios 5: 3-13. [in Spanish]
CrossRef | Gscholar
SoilGrids250m: global gridded soil information based on machine learning. PLoS One 12: 1-40.
CrossRef | Gscholar
DIVA-GIS, ver. 7.5. A geographic information system for the analysis of species distribution data. Versão 7: 476-486.
Gscholar
Conjunto nacional de uso del suelo y vegetación a escala 1:250.000, Serie I [National collection of land use and vegetation scale 1:250.000, Series I]. DOEG-INEGI, México City, Mexico. [in Spanish]
Gscholar
Conjunto nacional de uso del suelo y vegetación a escala 1:250.000, Serie VI [National collection of land use and vegetation scale 1:250.000, Series VI]. Instituto Nacional de Estadística y Geografía (INEGI), México City, Mexico. [in Spanish]
Gscholar
Distribución actual y futura del bosque subalpino de Pinus hartwegii Lindl en el Eje Neovolcánico Transversal [Current and future distribution of the Pinus hartwegii Lindl subalpine forest in the Tranverse Neovolcanic Belt]. Madera y Bosques 25: 1-16. [in Spanish]
CrossRef | Gscholar
A biogeographic analysis of Australian elapid snakes. In: “Atlas of elapid snakes of Australia” (Longmore R ed). Australian Flora and Fauna Series 7, Bureau of Flora and Fauna, Canberra, Australia, pp. 4-15.
Gscholar
ntbox: an R package with graphical user interface for modelling and evaluating multidimensional ecological niches. Methods in Ecology and Evolution 11: 1199-1206.
CrossRef | Gscholar
El nicho ecológico como herramienta para predecir áreas potenciales de dos especies de pino [The ecological niche as a tool for predicting potential areas of two pine species]. Revista Mexicana de Ciencias Forestales 9: 47-68. [in Spanish]
CrossRef | Gscholar
Diferencias conceptuales entre modelación de nichos y modelación de áreas de distribución [Conceptual differences between ecological niche modeling and species distribution modeling]. Revista Mexicana de Biodiversidad 88: 437-441. [in Spanish]
CrossRef | Gscholar
SiBiFor: forest biometric system for forest management in Mexico. Revista Chapingo Serie Ciencias Forestales y Del Ambiente 23: 437-455.
CrossRef | Gscholar