mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-politicalRefugees-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/1135.txt inflating: ./tmp/input/input-file/1801.txt inflating: ./tmp/input/input-file/2235.txt inflating: ./tmp/input/input-file/47518.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-politicalRefugees-gutenberg FILE: cache/47518.txt OUTPUT: txt/47518.txt FILE: cache/1135.txt OUTPUT: txt/1135.txt FILE: cache/1801.txt OUTPUT: txt/1801.txt FILE: cache/2235.txt OUTPUT: txt/2235.txt === file2bib.sh === id: 2235 author: Shakespeare, William title: The Tempest date: pages: extension: .txt txt: ./txt/2235.txt cache: ./cache/2235.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2235.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 1135 txt/../ent/1135.ent 2235 txt/../pos/2235.pos 2235 txt/../ent/2235.ent 1135 txt/../pos/1135.pos 1801 txt/../wrd/1801.wrd 1801 txt/../ent/1801.ent 1135 txt/../wrd/1135.wrd 2235 txt/../wrd/2235.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1801 txt/../pos/1801.pos === file2bib.sh === id: 1135 author: Shakespeare, William title: The Tempest date: pages: extension: .txt txt: ./txt/1135.txt cache: ./cache/1135.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1135.txt' === file2bib.sh === id: 1801 author: Shakespeare, William title: The Tempest date: pages: extension: .txt txt: ./txt/1801.txt cache: ./cache/1801.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1801.txt' 47518 txt/../wrd/47518.wrd 47518 txt/../pos/47518.pos === file2bib.sh === id: 47518 author: Shakespeare, William title: Shakespeare's Comedy of The Tempest date: pages: extension: .txt txt: ./txt/47518.txt cache: ./cache/47518.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'47518.txt' 47518 txt/../ent/47518.ent Done mapping. Reducing subject-politicalRefugees-gutenberg === reduce.pl bib === id = 1135 author = Shakespeare, William title = The Tempest date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1540) at https://www.gutenberg.org/ebooks/1540 cache = ./cache/1135.txt txt = ./txt/1135.txt === reduce.pl bib === === reduce.pl bib === id = 1801 author = Shakespeare, William title = The Tempest date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1540) at https://www.gutenberg.org/ebooks/1540 cache = ./cache/1801.txt txt = ./txt/1801.txt === reduce.pl bib === id = 47518 author = Shakespeare, William title = Shakespeare's Comedy of The Tempest date = pages = extension = .txt mime = text/plain words = 18265 sentences = 2767 flesch = 99 summary = She said thou wast my daughter; and thy father Thou art inclined to sleep; 'tis a good dulness, Let me remember thee what thou hast promised, And left thee there; where thou didst vent thy groans What wert thou, if the King of Naples heard thee? Sea-water shalt thou drink; thy food shall be Can speak like us: then wisely, good sir, weigh Thou let'st thy fortune sleep--die, rather; wink'st Shall free thee from the tribute which thou payest; And I the king shall love thee. If thou beest Trinculo, come forth: I'll pull thee by the Drink, servant-monster, when I bid thee: thy eyes are almost set Moon-calf, speak once in thy life, if thou beest a good Thou shalt be lord of it and I'll serve thee. Give me thy hand: I am sorry I beat thee; but, while thou Were but my trials of thy love, and thou cache = ./cache/47518.txt txt = ./txt/47518.txt Building ./etc/reader.txt 47518 2235 1801 47518 2235 1801 number of items: 4 sum of words: 18,345 average size in words: 6,115 average readability score: 91 nouns: sir; monster; king; man; page; time; sea; father; island; art; thy; illustration; brother; son; master; thee; daughter; nothing; cell; spirit; men; thing; life; day; mine; eyes; business; spirits; ship; o; hither; fish; earth; way; music; heart; bottle; air; isle; hour; hand; dukedom; word; shalt; power; none; night; moon; mind; loss verbs: is; be; have; do; are; ''s; was; were; make; come; am; did; let; give; say; had; take; go; know; done; speak; hear; being; made; enter; bring; tell; put; find; does; set; keep; saw; look; think; pray; lies; swear; stand; see; remember; heard; follow; bear; came; believe; been; said; lost; live adjectives: good; more; such; strange; own; true; poor; brave; much; foul; best; mine; dear; sweet; other; great; full; invisible; fresh; new; little; free; awake; asleep; rich; old; noble; like; first; fair; long; liest; green; delicate; dead; certain; better; auspicious; very; several; many; last; human; gentle; fine; thee; strong; sour; solemn; same adverbs: not; now; so; here; most; then; as; again; too; there; more; well; else; yet; up; out; no; aside; never; off; ever; very; forth; indeed; on; even; thus; still; rather; down; hence; first; away; once; in; before; therefore; tis; much; farther; enough; by; almost; together; sometime; far; all; strangely; safely; only pronouns: i; my; you; me; it; your; his; he; they; him; thy; we; thee; their; her; our; them; us; she; ''em; mine; myself; thyself; ''s; himself; yourself; yours; thou; itself; its; themselves; ourselves; theirs; on''t; o; herself; hers; do''t proper nouns: _; thou; pros; seb; ant; steph; gon; mir; cal; ariel; ari; alon; prospero; fer; enter; stephano; naples; trinculo; milan; caliban; sir; lord; gonzalo; sebastian; hath; hast; exeunt; ferdinand; exit; boats; antonio; miranda; iris; alonso; tunis; re; o''er; juno; duke; dost; e''er; camest; king; thee; nymphs; didst; dido; ceres; canst; yea keywords: ebook; trin; steph; seb; pros; mir; gon; cal; ari; ant; alon one topic; one dimension: thou file(s): ./cache/1135.txt titles(s): The Tempest three topics; one dimension: thou; ebook; edition file(s): ./cache/47518.txt, ./cache/1135.txt, titles(s): Shakespeare''s Comedy of The Tempest | The Tempest | The Tempest five topics; three dimensions: thou pros thee; ebooks early developed; ebooks early developed; ebooks early developed; ebooks early developed file(s): ./cache/47518.txt, , , , titles(s): Shakespeare''s Comedy of The Tempest | The Tempest | The Tempest | The Tempest | The Tempest Type: gutenberg title: subject-politicalRefugees-gutenberg date: 2021-06-07 time: 14:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Political refugees" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 1135 author: Shakespeare, William title: The Tempest date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1135.txt txt: ./txt/1135.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1540) at https://www.gutenberg.org/ebooks/1540 id: 1801 author: Shakespeare, William title: The Tempest date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1801.txt txt: ./txt/1801.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1540) at https://www.gutenberg.org/ebooks/1540 id: 2235 author: Shakespeare, William title: The Tempest date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 47518 author: Shakespeare, William title: Shakespeare''s Comedy of The Tempest date: words: 18265.0 sentences: 2767.0 pages: flesch: 99.0 cache: ./cache/47518.txt txt: ./txt/47518.txt summary: She said thou wast my daughter; and thy father Thou art inclined to sleep; ''tis a good dulness, Let me remember thee what thou hast promised, And left thee there; where thou didst vent thy groans What wert thou, if the King of Naples heard thee? Sea-water shalt thou drink; thy food shall be Can speak like us: then wisely, good sir, weigh Thou let''st thy fortune sleep--die, rather; wink''st Shall free thee from the tribute which thou payest; And I the king shall love thee. If thou beest Trinculo, come forth: I''ll pull thee by the Drink, servant-monster, when I bid thee: thy eyes are almost set Moon-calf, speak once in thy life, if thou beest a good Thou shalt be lord of it and I''ll serve thee. Give me thy hand: I am sorry I beat thee; but, while thou Were but my trials of thy love, and thou ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel