— by Robert W. Williams
Page Contents
• Retextualizer project: Sections 1 • 2 • 3 • 4 • 5
• Digital Humanities: Online Texts & Resources: Section 6
• Websites: Blogs, Centers, & DH Projects: Section 7
• Regular Expressions (Regexes): Tutorials & Testing: Section 8
• Regex Usage: Interpreting Illegible Letters: Section 9
The General Purpose of the Retextualizer Project
Retextualizer is a browser-based application for digital humanities (DH) research that is designed to facilitate new interpretations of a text, specifically by disassembling texts into meaningful components (here sentences), and then reassembling the components into different configurations, whether in reverse order or in random arrangements.
Retextualizer rearranges the original essay by juxtaposing sentences—perhaps jarringly—that were not initially so positioned; it thereby can provide the conditions for new insights into the text, its ideas, and its themes.
Each Retextualizer web page repeats the project's general purpose, as well as the instructions, which also can be read below. In addition, each project page will contain further information relevant to that specific essay.
Retextualizing the Works of W.E.B. Du Bois
"Souls of Black Folk" [SBFI](The Independent, 1904)
"The Individual and Social Conscience" [IASC](1905)
"Address to the Country" [ATTC](1906)
"The Nature of Intellectual Freedom" [IFRE](1949)
"Apologia", Suppression of the African Slave-Trade [SSTA](1954)
"Postscript", The Ordeal of Mansart [PSOM](1957)
The Project Goals of Retextualizer
Digital Humanities Research:
Texts can be read in sequence as created and/or presented, publicly or otherwise. With computers, we can digitally interact with such works. In their written forms as typescripts or manuscripts texts can be digitized and then can be (re-)analyzed and (re-)interpreted via computer software. Section 6 below lists online resources that cover various dimensions of the digital humanities. Many more resources can be located via Internet searches.
The digital manipulation of texts includes deformance, as Jerome McGann and Lisa Samuels called it in "Deformance and Interpretation" [New Literary History, 30:1 (1999): 25-56; Accessible online]. Literary works have unstable meanings, McGann and Samuels argued, and discussed several methods to use on literary texts, typically poems, including a reversal of the poem's lines.
Such techniques of deformation have received support, such as:
Cohen, Matt. 2006. "Trangenic Deformation: Literary Translation and the Digital Archive." Walt Whitman Archive [Website]. Online.
Sample, Mark. 2012. "Notes towards a Deformed Humanities." Samplereality [Blog], (Posted May 2). Online.
Criticisms of literary deformance have been put forward:
Hoover, David L. 2005. "Hot-Air Textuality: Literature after Jerome McGann." TEXT Technology, 14:2. Online (PDF).
Hoover, David L. 2007. "The End of the Irrelevant Text: Electronic Texts, Linguistics, and Literary Theory." Digital Humanities Quarterly, 1:2. Online.
Robert W. Williams's Research:
The current Retextualizer application builds on a previous version which had no copying, viewing, or sentence-numbering features. I initially coded the basic randomizing and display functions in May 1999 as a way to create and present randomized versions of essays by Immanuel Kant and Walter Benjamin.
Since that first version, digital humanities research has come to influence my scholarship, most notably by means of computer applications, such as concordancers and collation software. Those digital tools help me to understand how Du Bois paired words and phrases within their contexts, and also to illustrate how he re-used and modified text in different works over time. The Retextualizer project continues this avenue of my research.
I have created several hypertext presentations on issues related to DH. [This subsection was posted for the 1 February 2019 update.]
"The Intertextuality of Du Bois's Idea of Humanity: A Collation Analysis": The African American Studies and Research Center at Purdue University hosted the 30th Symposium on African American Culture and Philosophy on 1-3 December 2016. The symposium theme was "Exploring the 'Humanity' in the Digital Humanities".
"Algorithmic Displacement and the Black Atlantic: Retextualizing the 'Souls' Essay by W.E.B. Du Bois": I presented this at the 2018 African American Digital Humanities Conference, held at the University of Maryland, College Park, on 20 October 2018. I covered the use of the Retextualizer application on the SBFI essay.
Digital Humanities: Online Texts and Related Resources
Bailey, Moya Z. 2011. "All the Digital Humanists Are White, All the Nerds Are Men, but Some of Us Are Brave." Journal of Digital Humanities, 1:1 (Winter). Online at JDH.
Drucker, Johanna. 2012. "Humanities Theory and Digital Scholarship." In Debates in the Digital Humanities, 2012 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.cuny.edu/debates/text/34
Drucker, Johanna. 2012. "Representation and the Digital Environment: Essential Challenges for Humanists." Posted at the University of Minnesota Press blog, 16 May 2012. URL: http://uminnpressblog.com/2012/05/representation-and-digital-environment.html
Gallon, Kim. 2016. "Making a Case for the Black Digital Humanities." In Debates in the Digital Humanities, 2016 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.cuny.edu/debates/text/55
Gold, Matthew K. & Lauren F. Klein (Eds.). 2016. Debates in the Digital Humanities, 2016 Ed. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.cuny.edu/debates?id=2.
Kim, Dorothy & Jesse Stommel (Eds.). N.D. Disrupting the Digital Humanities. URL: www.disruptingdh.com/position-papers/
Liu, Allan. 2013. "The Meaning of the Digital Humanities." PMLA: Proceedings of the Modern Language Association, 128:2; pp.409-423. URL: http://escholarship.org/uc/item/5gc857tw.
Nowviskie, Bethany. 2016. "On the Origin of 'Hack' and 'Yack.'" In Debates in the Digital Humanities, 2016 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.cuny.edu/debates/text/58
Price, Kenneth M. & Ray Siemens. 2013-2015. Literary Studies in the Digital Age: An Evolving Anthology. URL: https://dlsanthology.mla.hcommons.org/.
Risam, Roopika. 2015. "Beyond the Margins: Intersectionality and the Digital Humanities." Digital Humanities Quarterly, 9:2. Online at DHQ.
Risam, Roopika. 2015. "On Disruption, Race, and the Digital Humanities." Disrupting the Digital Humanities, Digital Edition. URL: www.disruptingdh.com/on-disruption-race-and-the-digital-humanities.
Websites: Blogs, Centers, and DH Projects
African American History, Culture & Digital Humanities (UMD)
URL: https://aadhum.umd.edu
African Diaspora PhD
URL: https://africandiasporaphd.com
Alliance of Digital Humanities Organizations (ADHO)
URL: https://adho.org
Anna Julia Cooper Center
URL: https://ajccenter.com
Black Book Interactive Project
URL: http://bbip.ku.edu
Black Past: Online Reference Guide to African American History
URL: http://blackpast.org
Black Press Research Collective
URL: http://blackpressresearchcollective.org
Black Quotidian: Everyday History in African-American Newspapers
URL: http://blackquotidian.com
Carolina Digital Humanities (UNC-Chapel Hill)
URL: https://cdh.unc.edu/
Center for Digital Research in the Humanities (UNL)
URL: https://cdrh.unl.edu
Center for South Asian and Indian Ocean Studies
URL: https://as.tufts.edu/csaios/digitalHumanities
Credo: Special Collections & University Archives, W.E.B. Du Bois Library, University of Massachusetts Amherst (Papers of Horace Mann Bond, W.E.B. Du Bois, and others)
URL: http://credo.library.umass.edu/
Colored Conventions Project: Bringing 19th Century Organizing to Digital Life
URL: www.coloredconventions.org
DHCommons.org: A Collaboration Hub
URL: http://www.dhcommons.org
Diaspora Hypertext: Black Femme History and Futures
URL: http://dh.jmjafrx.com/tag/howard-ramsby-ii/
Digital Colored American Magazine
URL: http://coloredamerican.org/
Digital Harlem: Everyday Life 1915-1930
URL: http://digitalharlem.org
Digital Humanities Association of Southern Africa
URL: http://digitalhumanities.org.za
Digital Humanities Initiative
URL: http://www.dhinitiative.org
Digital Schomburg African American Women Writers of the 19th Century
URL: http://digital.nypl.org/schomburg/writers_aa19/
East Asian Digital Humanities Lab
URL: https://guides.library.harvard.edu/EADH
Frederick Douglass in Britain and Ireland
URL: http://frederickdouglassinbritain.com
HASTAC: Humanities, Arts, Science, & Technology Alliance & Collaboratory
URL: https://www.hastac.org
History of Women Philosophers
URL: https://historyofwomenphilosophers.org
Malcolm X: A Research Site
URL: http://www.brothermalcolm.net
O Say Can You See: Early Washington, D.C., Law & Family
URL: http://earlywashingtondc.org
Recovering the U.S. Hispanic Literary Heritage Blog
URL: https://recoveryprojectappblog.wordpress.com
The Ward: Race and Class in Du Bois' Seventh Ward
URL: http://www.dubois-theward.org
WWP: Women Writers Project
URL: http://www.wwp.northeastern.edu
Regular Expressions (Regexes)
https://regular-expressions.info/near.html (Jan Goyvaerts)
https://ryanstutorials.net/regular-expressions-tutorial/
https://regexland.com/regex-tutorial/
https://blog.robertelder.org/regular-expressions
https://devopedia.org/regex-engine
Online regex testinghttps://blog.robertelder.org/regular-expression-visualizer
https://extendsclass.com/regex-tester.html
Regex Usage: Interpreting Illegible Letters
I created a multi-part tweet in January 2023 on one way to use regular expressions to determine possible characters that otherwise were indecipherable in words arising from printed or handwritten documents. I present that tweet below in its original parts, but with a few additions and reconfigurations to enhance clarity.The unpublished works of #WEBDuBois sometimes contain handwritten words that I can't read.
To find potential words that fit the sentence's meaning I use regular expression searches of a word list loaded into a text editor or concordancer.
1/11
Case Study: Handwritten Words
I screen-captured one line within what seems to be a never-finished manuscript in which Du Bois discussed his underlying philosophy.
File: Unpublished Du Bois document archived in the Credo repository (UMASS Library's Special Collections).
Title: "Steps Toward a Science of How Men Act"
Typescript: 4 pages + 3 pages of handwritten notes by Du Bois
ID: mums312-b213-i071
What are the 3rd & 4th handwritten words in the image?
Admittedly, those words in the image might be understandable within the context of the sentence fragment.
For purposes of illustrating the #regex technique, I will try to decipher those words in this tweet thread.
#Regex coded by these indicators:
=Discernible letters (based on similarities w/ letters in known words).
=Number of letters in the word (create range of letters if boundaries are indistinct).
=Contractions need to be expanded for the word list, not for the dictionary.
Third word:
=Starting "e"
="g" or other letters with descenders?
=How many letters: 7/8?
Regex
\be[a-z]{1,3}[gjpqy][a-z]{4,5}\b
Regex briefly annotated (metacharacters manage the search):
\b =word boundary
e =literal letter "e"
[a-z]{1,3} =range of 1-3 letters: "a" through "z"
[gjpqy] =match any 1 letter
[a-z]{4,5} =range of 4-5 letters: "a" through "z"
\b =word boundary
Regex results of 3rd word: 1000+ matches.
Disambiguation of the irrelevant matches is needed because regex results of 3rd word exceeded 1000 matches.
The more letters we know, or can hypothesize about, the fewer the matches.
Hypothesize "v" and "h":
\bev[a-z]{4,5}h[a-z]{1,2}\b
Regex briefly annotated:
\b =word boundary
ev =literal letters "e" and "v"
[a-z]{4,5} =range of 4-5 letters: "a" through "z"
h =literal letter "h"
[a-z]{1,2} =range of 1-2 letters: "a" through "z"
\b =word boundary
Results: 11 matches, including "everythin"
Possibly no final "g" in the original.
Unknown fourth word:
=Starting "f"
=Final letters: "ch"?
=How many letters: 5/6?
Regex
\bfe[a-z]{1,2}ch\b
Regex briefly annotated:
\b =word boundary
fe =literal letters "f" and "e"
[a-z]{1,2} =range of 1-2 letters: "a" through "z"
ch =literal letters "c" and "h"
\b =word boundary
Results: 6 matches, including "fetich"
Plausible interpretation of the fragment in context:
"Force - in everything - Fetich"
I often need to create different regexes: I can
=Change the range of possible letters to match.
=Change the initial or other specified letters to find more possible words.
I then repeat the searching & disambiguation phases.
This #regex technique may not find plausible candidates.
#Regex technique assumptions:
=Standardized spelling
=Era-appropriate word list or dictionary
=Case insensitive [configurable]
=Patterns: same letter is written in a recognizable style
=Discernible letter boundaries that permit a counting of the letters (or a number range).
Useful Resources: Words
A. Word list: Alphabetical [No numbers or symbols]
https://github.com/dwyl/english-words/blob/master/words_alpha.txt
[Lowercase]
B. Project Gutenberg: Webster's Unabridged Dictionary
https://www.gutenberg.org/ebooks/29765
[Upper- & lowercase]
Useful Resources: Regular Expressions
A. #Regex tutorials & guides
* https://regular-expressions.info (Jan Goyvaerts)
* https://www.rexegg.com
* https://ryanstutorials.net/regular-expressions-tutorial/
* https://riptutorial.com/regex
* http://regextutorials.com
B. Regex testing
* https://regex101.com
* https://regexr.com
END of thread