mv: 'input-file.zip' and './input-file.zip' are the same file Creating study carrel named subject-chocolate-freebo Initializing database Unzipping Archive: input-file.zip inflating: ./tmp/input/A25542.xml inflating: ./tmp/input/xml2htm.xsl inflating: ./tmp/input/metadata.csv inflating: ./tmp/input/A19160.xml inflating: ./tmp/input/A36763.xml inflating: ./tmp/input/A61881.xml caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: === metadata file: ./tmp/input/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-chocolate-freebo May 24, 2021 4:49:07 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed. See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. May 24, 2021 4:49:07 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: Tesseract OCR is installed and will be automatically applied to image files unless you've excluded the TesseractOCRParser from the default parser. Tesseract may dramatically slow down content extraction (TIKA-2359). As of Tika 1.15 (and prior versions), Tesseract is automatically called. In future versions of Tika, users may need to turn the TesseractOCRParser on via TikaConfig. May 24, 2021 4:49:07 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version. INFO Starting Apache Tika 1.24.1 server INFO Setting the server's publish address to be http://localhost:9998/ INFO Logging initialized @2194ms to org.eclipse.jetty.util.log.Slf4jLog INFO jetty-9.4.27.v20200227; built: 2020-02-27T18:37:21.340Z; git: a304fd9f351f337e7c0e2a7c28878dd536149c6c; jvm 1.8.0_281-b09 INFO Started ServerConnector@3e74829{HTTP/1.1, (http/1.1)}{localhost:9998} INFO Started @2262ms WARN Empty contextPath INFO Started o.e.j.s.h.ContextHandler@b4711e2{/,null,AVAILABLE} INFO Started Apache Tika server at http://localhost:9998/ INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) FILE: cache/A25542.xml OUTPUT: txt/A25542.txt FILE: cache/A19160.xml OUTPUT: txt/A19160.txt FILE: cache/A36763.xml OUTPUT: txt/A36763.txt FILE: cache/A61881.xml OUTPUT: txt/A61881.txt === file2bib.sh === INFO Detecting media type for Filename: b'A25542.xml' INFO Detecting media type for Filename: b'A19160.xml' INFO Detecting media type for Filename: b'A36763.xml' INFO rmeta/text (autodetecting type) INFO Detecting media type for Filename: b'A61881.xml' INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) A25542 txt/../pos/A25542.pos A25542 txt/../wrd/A25542.wrd A25542 txt/../ent/A25542.ent === file2bib.sh === id: A25542 author: England and Wales. Parliament. House of Commons. title: An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. date: nan pages: extension: .xml txt: ./txt/A25542.txt cache: ./cache/A25542.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 7 resourceName b'A25542.xml' A19160 txt/../pos/A19160.pos A19160 txt/../ent/A19160.ent A19160 txt/../wrd/A19160.wrd A36763 txt/../pos/A36763.pos A36763 txt/../ent/A36763.ent === file2bib.sh === id: A19160 author: Colmenero de Ledesma, Antonio. title: A curious treatise of the nature and quality of chocolate. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. And put into English by Don Diego de Vades-forte date: 1640.0 pages: extension: .xml txt: ./txt/A19160.txt cache: ./cache/A19160.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 61 resourceName b'A19160.xml' A36763 txt/../wrd/A36763.wrd A61881 txt/../pos/A61881.pos === file2bib.sh === id: A36763 author: Chamberlayne, John, 1666-1723. title: The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. date: 1685.0 pages: extension: .xml txt: ./txt/A36763.txt cache: ./cache/A36763.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 79 resourceName b'A36763.xml' A61881 txt/../ent/A61881.ent A61881 txt/../wrd/A61881.wrd === file2bib.sh === id: A61881 author: Stubbe, Henry, 1632-1676. title: The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ; Thomas Gage, Survey of the West-Indies. chap. 15 ... date: 1662.0 pages: extension: .xml txt: ./txt/A61881.txt cache: ./cache/A61881.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 107 resourceName b'A61881.xml' Done mapping. Reducing subject-chocolate-freebo === reduce.pl bib === id = A25542 author = England and Wales. Parliament. House of Commons. title = An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. date = nan pages = extension = .xml mime = application/xml words = 1745 sentences = 354 flesch = 84 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). cache = ./cache/A25542.xml txt = ./txt/A25542.txt === reduce.pl bib === id = A61881 author = Stubbe, Henry, 1632-1676. title = The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ; Thomas Gage, Survey of the West-Indies. chap. 15 ... date = 1662.0 pages = extension = .xml mime = application/xml words = 49802 sentences = 14394 flesch = 89 summary = The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... cache = ./cache/A61881.xml txt = ./txt/A61881.txt === reduce.pl bib === id = A19160 author = Colmenero de Ledesma, Antonio. title = A curious treatise of the nature and quality of chocolate. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. And put into English by Don Diego de Vades-forte date = 1640.0 pages = extension = .xml mime = application/xml words = 7941 sentences = 2200 flesch = 94 summary = Textual changes and metadata enrichments aim at making the text more computationally tractable, easier to read, and suitable for network-based collaborative curation by amateur and professional end users from many walks of life. This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. . The text can be copied, modified, distributed and performed, even for commercial purposes, all without asking permission. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. And Colmenero de Ledesma, Antonio 1640 8829 1 0 0 0 0 0 1 B The rate of 1 defects per 10,000 words puts this text in the B category of texts with fewer than 10 defects per 10,000 words. cache = ./cache/A19160.xml txt = ./txt/A19160.txt === reduce.pl bib === id = A36763 author = Chamberlayne, John, 1666-1723. title = The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. date = 1685.0 pages = extension = .xml mime = application/xml words = 20283 sentences = 5486 flesch = 94 summary = The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A36763.xml txt = ./txt/A36763.txt Building ./etc/reader.txt A61881 A36763 A19160 A61881 A36763 A25542 number of items: 4 sum of words: 79,771 average size in words: 19,942 average readability score: 90 nouns: parts; water; others; use; way; time; nature; ingredients; body; self; reason; drink; quantity; part; taste; heat; thing; blood; stomach; t; quality; one; colour; substance; obstructions; fire; paste; ▪; effects; opinion; experience; seed; men; text; sort; nothing; mixture; nuts; nut; composition; things; pepper; fat; degree; meats; man; day; qualities; manner; fruit verbs: is; be; are; have; being; was; made; make; do; put; take; had; did; were; used; say; been; drink; having; has; called; found; taken; give; said; use; according; call; makes; see; hath; given; dissolved; concerning; milled; think; eat; set; done; mix; let; find; beaten; observed; am; taking; seems; seem; prepared; know adjectives: other; hot; great; cold; little; good; same; several; much; many; more; such; first; dry; third; drunk; true; last; long; particular; ordinary; red; own; different; second; certain; greater; best; large; indian; fat; excellent; better; small; least; general; former; simple; white; common; unctuous; most; like; whole; new; fine; old; natural; less; necessary adverbs: not; so; then; very; more; well; as; much; also; only; most; too; thereof; up; in; therefore; never; now; yet; thus; first; out; before; there; rather; here; together; otherwise; especially; already; sometimes; however; therein; even; away; long; almost; less; usually; often; again; alone; instead; better; ever; onely; once; moderately; all; easily pronouns: it; i; they; their; them; he; his; we; its; my; you; our; me; him; us; themselves; her; one; your; she; himself; thy; whereof; mine; theirs; thee; yours; thier; s; ours; ingender''d; f; em proper nouns: chocolata; cacao; de; y; chocolate; indies; ●; nut; stomach; que; c.; achiote; indians; pepper; sugar; la; el; spain; tcp; piso; hernandez; english; nature; hath; l.; maiz; tree; spanish; spaniards; iamaica; water; liver; 〉; lib; es; se; physicians; paste; gage; chap; ◊; drink; le; england; chocolatte; mexico; con; spirits; por; physick keywords: tcp; indies; cacao; stomach; ingredients; drink; chocolate; water; tree; tea; sugar; spirits; spice; spanish; spaniards; pound; piso; physicians; pepper; paste; obstructions; nut; nature; majesty; indians; hernandez; gage; degree; confection; chocolata; book; blood; achiote one topic; one dimension: chocolata file(s): ./cache/A36763.xml titles(s): The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. three topics; one dimension: chocolata; drink; calculation file(s): ./cache/A61881.xml, ./cache/A36763.xml, ./cache/A25542.xml titles(s): The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ; Thomas Gage, Survey of the West-Indies. chap. 15 ... | The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. | An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. five topics; three dimensions: chocolata cacao hot; chocolate cacao use; coffee pence text; mens laid granted; mens laid granted file(s): ./cache/A61881.xml, ./cache/A36763.xml, ./cache/A25542.xml, ./cache/A25542.xml, ./cache/A25542.xml titles(s): The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ; Thomas Gage, Survey of the West-Indies. chap. 15 ... | The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. | An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. | An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. | An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. Type: zip2carrel title: subject-chocolate-freebo date: 2021-05-24 time: 16:48 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: input-file.zip ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: A36763 author: Chamberlayne, John, 1666-1723. title: The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. date: 1685.0 words: 20283 sentences: 5486 pages: flesch: 94 cache: ./cache/A36763.xml txt: ./txt/A36763.txt summary: The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. The manner of making of coffee, tea, and chocolate as it is used in most parts of Europe, Asia, Africa, and America, with their vertues / newly done out of French and Spanish. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A19160 author: Colmenero de Ledesma, Antonio. title: A curious treatise of the nature and quality of chocolate. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. And put into English by Don Diego de Vades-forte date: 1640.0 words: 7941 sentences: 2200 pages: flesch: 94 cache: ./cache/A19160.xml txt: ./txt/A19160.txt summary: Textual changes and metadata enrichments aim at making the text more computationally tractable, easier to read, and suitable for network-based collaborative curation by amateur and professional end users from many walks of life. This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. . The text can be copied, modified, distributed and performed, even for commercial purposes, all without asking permission. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. VVritten in Spanish by Antonio Colmenero, doctor in physicke and chirurgery. And Colmenero de Ledesma, Antonio 1640 8829 1 0 0 0 0 0 1 B The rate of 1 defects per 10,000 words puts this text in the B category of texts with fewer than 10 defects per 10,000 words. id: A25542 author: England and Wales. Parliament. House of Commons. title: An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. date: nan words: 1745 sentences: 354 pages: flesch: 84 cache: ./cache/A25542.xml txt: ./txt/A25542.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. An Answer to a paper set forth by the coffee-men directed to the Honourable, the Commons in Parliament assembled being reflections upon some propositions that were exhibited to the Parliament for the changing the excise of coffee, tea, and chocolate into a custom upon the commodities. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). id: A61881 author: Stubbe, Henry, 1632-1676. title: The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ; Thomas Gage, Survey of the West-Indies. chap. 15 ... date: 1662.0 words: 49802 sentences: 14394 pages: flesch: 89 cache: ./cache/A61881.xml txt: ./txt/A61881.txt summary: The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... The Indian nectar, or, A discourse concerning chocolata the nature of cacao-nut and the other ingredients of that composition is examined and stated according to the judgment and experience of the Indian and Spanish writers ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... its effects as to its alimental and venereal quality as well as medicinal (especially in hypochondrial melancholy) are fully debated : together with a spagyrical analysis of the cacao-nut, performed by that excellent chymist Monsieur le Febure, chymist to His Majesty / by Henry Stubbe ... ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel