https://doi.org/10.1140/epjp/s13360-022-02707-6
Regular Article
Bayesian analysis of plant DNA size distribution via non-additive statistics
1
Departamento de Física, Universidade do Estado do Rio Grande do Norte, 59610-210, Mossoró, Brazil
2
Departamento de Física, Universidade Federal do Rio Grande do Norte, 59072-970, Natal, RN, Brazil
3
Departamento de Ciências Vegetais, Universidade Federal Rural do Semi-Árido, 59625-900, Mossoró, RN, Brazil
4
Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970, Natal, RN, Brazil
Received:
28
January
2022
Accepted:
8
April
2022
Published online:
21
April
2022
We report an analysis of Cucurbitaceae DNA species through a non-additive statistics formalism. Such formalisms encompass power-law correlations. Our analysis focuses on the distribution of the length of DNA bases. We compared two models for fitting the data, single and double q-exponentials, via Bayesian analysis. Our data set was retrieved from a catalog of proteins (NCBI). The results indicate that the short-range correlations, always present in coding DNA sequences, are appropriately captured through the double q-exponential, adequately describing the cumulative length distribution of DNA bases. We also found two very distinctive q-ranges for introns and exons for all chromosomes, for both Cucurbitaceae subspecies analyzed in this work.
© The Author(s), under exclusive licence to Società Italiana di Fisica and Springer-Verlag GmbH Germany, part of Springer Nature 2022