scalable text; topic modelling; fractal geometry; informational granularity; digital hermeneutics
Résumé :
[en] This chapter proposes a method of text analysis that combines conceptual aspects from the model of scalable or zoomable text (z-text), topic modelling and fractal geometry. It argues that this type of methodology may assist in detecting different levels of generality and specificity in texts and reveal some characteristics of the assemblage of blocks of text, above the word level, at different scales of representation. Applications of such an approach can range from hermeneutics and discourse analysis to text (and possibly z-text) generation and summarization.
Centre de recherche :
Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History & Historiography (DHI)
Disciplines :
Arts & sciences humaines: Multidisciplinaire, généralités & autres
Auteur, co-auteur :
ARMASELU, Florentina ; University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History and Historiography
Co-auteurs externes :
no
Langue du document :
Anglais
Titre :
Text, Fractal Dust and Informational Granularity: A Study of Scale
Date de publication/diffusion :
18 décembre 2023
Titre de l'ouvrage principal :
Zoomland: Exploring Scale in Digital History and Humanities
Editeur scientifique :
ARMASELU, Florentina ; University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History and Historiography
FICKERS, Andreas ; University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History and Historiography
Allain, C., and M. Cloitre. "Characterizing the Lacunarity of Random and Deterministic Fractal Sets". Physical Review A 44.6 (1991): 3552-3558.
Armaselu (Vasilescu), Florentina. "Le livre sous la loupe. Nouvelles formes d'écriture électronique" (PhD thesis, Université de Montréal, 2010). Accessed July 1, 2023. https://papyrus.bib.umontreal. ca/xmlui/handle/1866/3964.
Armaselu, Florentina, and Charles Van den Heuvel. "Metaphors in Digital Hermeneutics: Zooming through Literary, Didactic and Historical Representations of Imaginary and Existing Cities". Digital Humanities Quarterly (DHQ) 11.3 (2017). Accessed July 1, 2023. http://www.digitalhuman ities.org/dhq/vol/11/3/000337/000337.html.
Barthes, Roland. S/Z. New York: Hill and Wang, 1974.
Barzilay, Regina, and Michael Elhadad. "Using Lexical Chains for Text Summarization". Intelligent Scalable Text Summarization (1997): 10-17.
Bjornson, Richard. "Cognitive Mapping and the Understanding of Literature". SubStance Vol. 10.1 Issue 30 (1981): 51-62.
Blei, David M. "Introduction to Probabilistic Topic Models". Communications of the ACM 55(2011). Accessed July 1, 2023. https://www.researchgate.net/publication/248701790_Introduction_to_ Probabilistic_Topic_Models.
Blei, David M., Thomas L. Griffiths, and Michael I. Jordan. "The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies". arXiv:0710.0845, 27 August 2009. Accessed July 1, 2023. http://arxiv.org/abs/0710.0845.
Brook, Timothy. Vermeer's Hat: The Seventeenth Century and the Dawn of the Global World. London, UK: Profile Books, 2009.
Da Silva, David. "Caractérisation de La Nature Multi-échelles Des Plantes Par Des Outils de Géométrie Fractale, Influence Sur l'interception de La Lumière". (PhD thesis, Université Montpellier 2, 2008). Accessed July 1, 2023. https://www.researchgate.net/publication/256093785_Evaluation_ des_caracteristiques_geometriques_d'une_structure_vegetale_dans_le_cadre_de_l'analyse_ fractale.
Datseris, George, Inga Kottlarz, Anton P. Braun, and Ulrich Parlitz. "Estimating the Fractal Dimension: A Comparative Review and Open Source Implementations". arXiv, 13 September 2021. Accessed July 1, 2023. http://arxiv.org/abs/2109.05937.
Di Ieva, Antonio. The Fractal Geometry of the Brain. Springer Series in Computational Neuroscience. New York: Springer, 2016.
Dretske, Fred I. Knowledge and the Flow of Information, The David Hume Series, Philosophy and Cognitive Science Reissues, Leland Stanford Junior University: CSLI Publications, 1999.
Falconer, Kenneth. Fractal Geometry: Mathematical Foundations and Applications. West Sussex, United Kingdom: John Wiley & Sons, Incorporated, 2014.
Fang, Yunxia, Xiaoqin Zhang, Xian Zhang, Tao Tong, Ziling Zhang, Gengwei Wu, Linlin Hou, et al. "A High-Density Genetic Linkage Map of SLAFs and QTL Analysis of Grain Size and Weight in Barley (Hordeum Vulgare L.)". Frontiers in Plant Science 11 (2020). https://doi.org/10.3389/fpls.2020. 620922.
Gouyet, J.-F. Physics and Fractal Structures. Paris: Masson éditeur, 1996. Accessed July 1, 2023. https://vdoc.pub/documents/physics-and-fractal-structures-p7bgb9hco6c0.
Graham, Shawn, Scott Weingart, and Ian Milligan. "Getting Started with Topic Modeling and MALLET". Programming Historian, 2012. https://doi.org/10.46430/phen0017.
Harrar, K, and L Hamami. "The Box Counting Method for Evaluate the Fractal Dimension in Radiographic Images". In 6th International Conference on Circuits, Systems, Electronics, Control & Signal Processing (CSECS'07), 6. Cairo, Egypt, 2007. Accessed July 1, 2023. https://www.research gate.net/publication/254455405_The_Box_Counting_Method_for_Evaluate_the_Fractal_Dimen sion_in_Radiographic_Images.
Hart, Michael. "The Project Gutenberg Mission Statement". 2004. Accessed July 1, 2023. https://www. gutenberg.org/about/background/mission_statement.html.
Hitchcock, Tim. "Big Data, Small Data and Meaning". Historyonics, 9 November 2014. Accessed July 1, 2023. http://historyonics.blogspot.com/2014/11/big-data-small-data-and-meaning_9.html.
Ignatenko, V, S Koltcov, S Staab, and Z Boukhers. "Fractal Approach for Determining the Optimal Number of Topics in the Field of Topic Modeling". Journal of Physics: Conference Series 1163 (2019). https://doi.org/10.1088/1742-6596/1163/1/012025.
Ijasan, Kolawole, George Tweneboah, and Jones Odei Mensah. "Anti-Persistence and Long-Memory Behaviour of SAREITs". Journal of Property Investment & Finance 35.4 (2017): 356-368. https://doi. org/10.1108/JPIF-09-2016-0073.
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. An Introduction to Statistical Learning with Applications in R. Vol. 103. Springer Texts in Statistics. New York, NY: Springer, 2017. https://doi.org/10.1007/978-1-4614-7138-7.
Johnson, Steven. "The long zoom". Seminars about Long-Term Thinking, 11 May 2007. Accessed July 1, 2023. https://longnow.org/seminars/02007/may/11/the-long-zoom/.
Kaminskiy, Roman, Nataliya Shakhovska, Jana Kajanova, and Yurii Kryvenchuk. "Method of Distinguishing Styles by Fractal and Statistical Indicators of the Text as a Sequence of the Number of Letters in Its Words". Edited by Marcin Hernes. Mathematics 9, no. 2410 (2021). https://doi.org/10.3390/math9192410.
Karperien, Audrey L., and Herbert F. Jelinek. "Box-Counting Fractal Analysis: A Primer for the Clinician". In The Fractal Geometry of the Brain edited by Antonio Di Ieva, 13-42 New York: Springer, Springer Series in Computational Neuroscience, 2016.
Klinkenberg, Brian. "A Review of Methods Used to Determine the Fractal Dimension of Linear Features". Mathematical Geology 26.1 (1994): 23-46. https://doi.org/10.1007/BF02065874.
Lafon, Pierre. "Sur la variabilité de la fréquence des formes dans un corpus". Mots 1.1 (1980): 127-165. https://doi.org/10.3406/mots.1980.1008.
Mandelbrot, Benoit B. The Fractal Geometry of Nature. New York: W.H. Freeman and Company, 1983.
Marcus, Solomon, Poetica matematicâ (Mathematical Poetics), Bucharest, Editura Academiei Republicii Socialiste Romania, 1970.
McCallum, Andrew Kachites. MALLET: A Machine Learning for Language Toolkit. V. 2.0.8. University of Massachusetts Amherst. 2002. Accessed July 26, 2023. https://mallet.cs.umass.edu/download.php.
Moretti, Franco. Distant Reading. London, UK, New York, US: Verso, 2013.
Morris, Jane, and Graeme Hirst. "Lexical Cohesion Computed by Thesaural Relations as an Indicator of the Structure". Computational Linguistics 17.1 (1991): 21-48. https://doi.org/10.1016/B0-08- 044854-2/05234-2.
Mueller, Martin. "Shakespeare His Contemporaries: Collaborative Curation and Exploration of Early Modern Drama in a Digital Environment". Digital Humanities Quarterly (DHQ) 8.3 (2014). Accessed July 27, 2023. http://www.digitalhumanities.org/dhq/vol/8/3/000183/000183.html.
Najafi, Elham, and Amir H. Darooneh. "The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction". Edited by Francisco J. Esteban. PLOS ONE 10.6 (2015): e0130617.
Nietzsche, Friedrich. Beyond Good and Evil. The Project Gutenberg eBook, 2009, reprint of the Helen Zimmern translation from German into English of "Beyond Good and Evil," as published in The Complete Works of Friedrich Nietzsche (1909-1913). Accessed July 24, 2023. https://www.guten berg.org/ebooks/4363.
Onicescu, Octav, "Energie informationnelle", Comptes RendusAcad. Sci., Paris, 263, 1966, 22, 841-842, cited in Marcus (1970).
Ostwald, Michael J., and Josephine Vaughan. The Fractal Dimension of Architecture. Cham: Springer International Publishing, 2016. https://doi.org/10.1007/978-3-319-32426-5.
Pareyon, Gabriel. "Fractal Theory and Language: The Form of Macrolinguistics". In Form and Symmetry: Art and Science Buenos Aires Congress, 2007. Accessed July 23, 2023. https://www.mi. sanu.ac.rs/vismath/BA2007/sym79.pdf.
Pavlov, Alexey N., Werner Ebeling, Lutz Molgedey, Amir R. Ziganshin, and Vadim S. Anishchenko. "Scaling Features of Texts, Images and Time Series". Physica A: Statistical Mechanics and Its Applications 300.1-2 (2001): 310-324. https://doi.org/10.1016/S0378-4371(01)00341-7.
Peltier, Jon. "Step Chart Without Risers". Peltier Tech. Peltier Technical Services-Excel Charts and Programming (blog), 24 May 2008. Accessed July 1, 2023. https://peltiertech.com/line-chart- without-risers/.
Peng, C.-K., S. V. Buldyrev, A. L. Goldberger, S. Havlin, F. Sciortino, M. Simons, and H.E. Stanley. "Long-Range Correlations in Nucleotide Sequences". Nature 356.6365 (1992): 168-170. https://doi.org/10.1038/356168a0.
Perez-Mercader, Juan. "Scaling Phenomena and the Emergence of Complexity in Astrobiology". In Astrobiology, edited by Gerda Horneck and Christa Baumstark-Khan, 337-360. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002. https://doi.org/10.1007/978-3-642-59381-9_22.
Plotnick, Roy E., Robert H. Gardner, and Robert V. O'Neill. "Lacunarity Indices as Measures of Landscape Texture". Landscape Ecology 8.3 (1993): 201-211. https://doi.org/10.1007/BF00125351.
Resnik, Philip. "Using Information Content to Evaluate Semantic Similarity in a Taxonomy". arXiv: Cmp-Lg/9511007 November, 1995. Accessed July 1, 2023. http://arxiv.org/abs/cmp-lg/9511007.
Rothschild, Emma. The Inner Life of Empires: An Eighteenth-Century History. Princeton, US, Oxford, UK: Princeton University Press, 2011.
Ryan, Marie-Laure. "Cognitive Maps and the Construction of Narrative Space". In Narrative Theory and the Cognitive Sciences, edited by David Herman, 214-242. Stanford, California: CSLI Publications, 2003.
Saha, Kunal, Vinodh Madhavan, and Chandrashekhar G.R. "Pitfalls in Long Memory Research". In Cogent Economics & Finance, edited by David McMillan 8.1 (2020): 1733280.
Sapoval, Bernard. Les Fractales. Fractals. Paris: Aditech, 1989.
Shannon, Claude E. "A Mathematical Theory of Communication". The Bell System Technical Journal 27 (1948): 379-423, 623-656.
Shannon, Claude E. "Prediction and Entropy of Printed English". The Bell System Technical Journal, January 1951, 12.
Slawomirski, Mariusz R. "Fractal Structures and Self-Similar Forms in the Artwork of Salvador Dali". Prace Instytutu Mechaniki Gorotworu PAN, Instytut Mechaniki Gorotworu PAN 15.3-4 (2013): 131-146.
Sparks, Randy J. The Two Princes of Calabar: An Eighteenth-Century Atlantic Odyssey. Cambridge, Massachusetts, London, England: Harvard University Press, 2004.
Stein, Sarah, Abrevaya. Plumes: Ostrich Feathers, Jews, and a Lost World of Global Commerce. New Haven and London: Yale University Press, 2008.
Swift, Jonathan. Gulliver's Travels into Several Remote Nations of the World. The Project Gutenberg eBook, 2009, first published in 1726. Accessed July 24, 2023. https://www.gutenberg.org/ ebooks/829.
Tacenko, Natalija. "Fractal Theory of Discourse Construction: Some Hypothetic Ideas". UDC 81'111, November 2016, 1-8. Accessed July 1, 2023. https://www.researchgate.net/publication/ 313403670_FRACTAL_THEORY_OF_DISCOURSE_CONSTRUCTION_SOME_HYPOTHETIC_IDEAS.
Tanaka-Ishii, Kumiko. "Long-Range Correlation Underlying Childhood Language and Generative Models". Frontiers in Psychology 9 (2018): 1725. https://doi.org/10.3389/fpsyg.2018.01725.
Tanaka-Ishii, Kumiko, andArmin Bunde. "Long-Range Memory in Literary Texts: On the Universal Clustering of the Rare Words". Edited by Tobias Preis. PLOS ONE 11.11 (2016): e0164658. https://doi.org/10.1371/journal.pone.0164658.
Tolle, Charles R., Timothy R. McJunkin, David T. Rohrbaugh, and Randall A. LaViolette. "Lacunarity Definition for Ramified Data Sets Based on Optimal Cover". Physica D: Nonlinear Phenomena 179.3-4 (2003): 129-152. https://doi.org/10.1016/S0167-2789(03)00029-0.
Trivellato, Francesca. "Is There a Future for Italian Microhistory in the Age of Global History?" California Italian Studies 2.1 (2011): 25. Accessed July 24, 2023. http://escholarship.org/uc/item/ 0z94n9hq.
Underwood, Ted. Distant Horizons. Digital Evidence and Literary Change. Chicago and London: The University of Chicago Press, 2019.
Wills, John E., Jr. 1688: A Global History. New York, London: W.W. Norton & Company, 2001.
Wu, Jiaxin, Xin Jin, Shuo Mi, and Jinbo Tang. "An Effective Method to Compute the Box-Counting Dimension Based on the Mathematical Definition and Intervals". Results in Engineering 6 (2020). https://doi.org/10.1016/j.rineng.2020.100106.
Zipf, George Kingsley. Human Behavior and the Principle of Least Effort. An Introduction to Human Ecology. Mansfield Centre, CT: Martino Publishing, 2012, first published by Addison-Wesley Press, 1949.