mv: 'input-file.zip' and './input-file.zip' are the same file Creating study carrel named subject-paperIndustry-freebo Initializing database Unzipping Archive: input-file.zip inflating: ./tmp/input/xml2htm.xsl inflating: ./tmp/input/metadata.csv inflating: ./tmp/input/A81258.xml inflating: ./tmp/input/A46574.xml caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: === metadata file: ./tmp/input/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-paperIndustry-freebo May 24, 2021 8:06:20 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed. See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. May 24, 2021 8:06:20 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: Tesseract OCR is installed and will be automatically applied to image files unless you've excluded the TesseractOCRParser from the default parser. Tesseract may dramatically slow down content extraction (TIKA-2359). As of Tika 1.15 (and prior versions), Tesseract is automatically called. In future versions of Tika, users may need to turn the TesseractOCRParser on via TikaConfig. May 24, 2021 8:06:20 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version. INFO Starting Apache Tika 1.24.1 server INFO Setting the server's publish address to be http://localhost:9998/ INFO Logging initialized @1352ms to org.eclipse.jetty.util.log.Slf4jLog INFO jetty-9.4.27.v20200227; built: 2020-02-27T18:37:21.340Z; git: a304fd9f351f337e7c0e2a7c28878dd536149c6c; jvm 1.8.0_281-b09 INFO Started ServerConnector@3e74829{HTTP/1.1, (http/1.1)}{localhost:9998} INFO Started @1418ms WARN Empty contextPath INFO Started o.e.j.s.h.ContextHandler@62010f5c{/,null,AVAILABLE} INFO Started Apache Tika server at http://localhost:9998/ INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) FILE: cache/A46574.xml OUTPUT: txt/A46574.txt FILE: cache/A81258.xml OUTPUT: txt/A81258.txt === file2bib.sh === INFO Detecting media type for Filename: b'A46574.xml' INFO Detecting media type for Filename: b'A81258.xml' INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) A81258 txt/../pos/A81258.pos A46574 txt/../ent/A46574.ent A46574 txt/../wrd/A46574.wrd A81258 txt/../ent/A81258.ent A81258 txt/../wrd/A81258.wrd A46574 txt/../pos/A46574.pos === file2bib.sh === id: A46574 author: England and Wales. Sovereign (1685-1688 : James II) title: A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. date: 1687 pages: extension: .xml txt: ./txt/A46574.txt cache: ./cache/A46574.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 11 resourceName b'A46574.xml' === file2bib.sh === id: A81258 author: Company of White Paper Makers (London, England) title: The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. date: 1699 pages: extension: .xml txt: ./txt/A81258.txt cache: ./cache/A81258.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 12 resourceName b'A81258.xml' Done mapping. Reducing subject-paperIndustry-freebo === reduce.pl bib === id = A46574 author = England and Wales. Sovereign (1685-1688 : James II) title = A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. date = 1687 pages = extension = .xml mime = application/xml words = 1469 sentences = 238 flesch = 78 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. Printed by Charles Bill, Henry Hills, and Thomas Newcomb ..., At end of text: Given at our court at Whitehall the nine and twentieth day of April, 1687. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). cache = ./cache/A46574.xml txt = ./txt/A46574.txt === reduce.pl bib === id = A81258 author = Company of White Paper Makers (London, England) title = The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. date = 1699 pages = extension = .xml mime = application/xml words = 2190 sentences = 393 flesch = 82 summary = The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). Company of White Paper Makers (London, England). -Case of the Company of White-Paper-Makers -Early works to 1800. cache = ./cache/A81258.xml txt = ./txt/A81258.txt Building ./etc/reader.txt A81258 A46574 A81258 A46574 number of items: 2 sum of words: 3,659 average size in words: 1,829 average readability score: 80 nouns: text; paper; making; texts; characters; makers; xml; works; work; image; books; edition; case; title; project; page; keying; images; encoding; elements; eebo; data; time; purposes; materials; encouragement; users; sets; selection; schema; quantities; mills; men; markup; instances; guidelines; editions; writing; sheet; sellers; reasons; reason; quantity; proclamation; prizes; passing; others; number; manufacture; establishing verbs: is; be; have; was; are; were; been; encoded; said; made; do; make; intended; being; making; based; given; set; sent; represented; published; marked; imployed; created; create; corrected; brought; -; using; take; stiled; stated; performed; pay; offered; misrepresented; means; intituled; had; establishing; appears; according; use; understanding; transformed; transcribed; taken; simplify; served; scanned adjectives: such; other; early; same; english; great; general; available; greater; white; small; many; illegible; good; first; better; twentieth; sufficient; sole; several; second; present; possible; due; disabled; wide; void; vast; utmost; usual; true; textual; syntactic; subject; structural; readable; quality; public; own; overall; original; more; monographic; lossless; like; light; later; last; large; keyboarded adverbs: not; very; then; therefore; now; better; also; online; humbly; truly; so; most; in; hereby; here; as; whatsoever; well; variously; usually; thereof; sometimes; respectfully; over; out; notably; never; mainly; even; early; away; already; accurately; abroad; above; yearly; up; thereupon; thereby; strictly; otherwise; only; more; lately; instead; hereafter; forth; expresly; evidently; duly pronouns: their; our; it; they; we; them; i; his; your; themselves; its; him; he proper nouns: paper; tcp; white; england; english; company; text; tei; eebo; act; oxford; makers; james; proquest; phase; partnership; london; creation; trade; mills; law; utf-8; unicode; transcribed; printing; parliament; p5; online; new; ncbel; monopoly; michigan; manufacture; kingdom; king; governor; brown; writing; wing; royal; r.; persons; lincoln; ii; grant; corrupt; charge; bill; withdraw; whitehall keywords: tcp; paper; england one topic; one dimension: paper file(s): ./cache/A81258.xml titles(s): The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. three topics; one dimension: paper; lately; lately file(s): ./cache/A81258.xml, ./cache/A46574.xml, ./cache/A46574.xml titles(s): The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. five topics; three dimensions: paper making white; text tcp eebo; publisher creative database; publisher creative database; publisher creative database file(s): ./cache/A81258.xml, ./cache/A46574.xml, ./cache/A46574.xml, ./cache/A46574.xml, ./cache/A46574.xml titles(s): The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. | A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. Type: zip2carrel title: subject-paperIndustry-freebo date: 2021-05-24 time: 19:54 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: input-file.zip ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: A81258 author: Company of White Paper Makers (London, England) title: The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The same being misrepresented in a paper stiled, The case of the Company of White-Paper-makers. date: 1699 words: 2190 sentences: 393 pages: flesch: 82 cache: ./cache/A81258.xml txt: ./txt/A81258.txt summary: The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. The case and circumstances of paper-making in England truly stated And by the paper-sellers humbly offered to the consideration of this present Parliament, as reasons against the passing of a bill, intituled An act for the encouragement and better establishing the making of white-writing and printing-paper. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). Company of White Paper Makers (London, England). -Case of the Company of White-Paper-Makers -Early works to 1800. id: A46574 author: England and Wales. Sovereign (1685-1688 : James II) title: A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. date: 1687 words: 1469 sentences: 238 pages: flesch: 78 cache: ./cache/A46574.xml txt: ./txt/A46574.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. A proclamation for the encouraging and better establishing of the manufacture of white paper in England James R. Printed by Charles Bill, Henry Hills, and Thomas Newcomb ..., At end of text: Given at our court at Whitehall the nine and twentieth day of April, 1687. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel