mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-philosophyChinese-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/24055.txt inflating: ./tmp/input/input-file/3330.txt inflating: ./tmp/input/input-file/216.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-philosophyChinese-gutenberg FILE: cache/24055.txt OUTPUT: txt/24055.txt FILE: cache/216.txt OUTPUT: txt/216.txt FILE: cache/3330.txt OUTPUT: txt/3330.txt === file2bib.sh === id: 3330 author: Confucius title: The Analects of Confucius (from the Chinese Classics) date: pages: extension: .txt txt: ./txt/3330.txt cache: ./cache/3330.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'3330.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 24055 author: Confucius title: The Sayings of Confucius date: pages: extension: .txt txt: ./txt/24055.txt cache: ./cache/24055.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'24055.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 24055 txt/../wrd/24055.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 3330 txt/../ent/3330.ent 3330 txt/../wrd/3330.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 24055 txt/../pos/24055.pos 3330 txt/../pos/3330.pos 24055 txt/../ent/24055.ent 216 txt/../wrd/216.wrd 216 txt/../pos/216.pos === file2bib.sh === id: 216 author: Laozi title: The Tao Teh King, or the Tao and its Characteristics date: pages: extension: .txt txt: ./txt/216.txt cache: ./cache/216.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'216.txt' 216 txt/../ent/216.ent Done mapping. Reducing subject-philosophyChinese-gutenberg === reduce.pl bib === === reduce.pl bib === === reduce.pl bib === id = 216 author = Laozi title = The Tao Teh King, or the Tao and its Characteristics date = pages = extension = .txt mime = text/plain words = 10731 sentences = 914 flesch = 88 summary = When we can lay hold of the Tao of old to direct the things 1. When the Great Tao (Way or Method) ceased to be observed, am different from other men, but I value the nursing-mother (the Tao). 5. The relation of the Tao to all the world is like that of the great 3. Hence the sage is able (in the same way) to accomplish his great to) conduct (a government) according to the Great Tao, what I should 2. The great Tao (or way) is very level and easy; but people love the 4. The great state only wishes to unite men together and nourish them; Tao has of all things the most honoured place. previous state in which they were easy, and all great things from one 1. All the world says that, while my Tao is great, it yet appears 4. Therefore the sage knows (these things) of himself, but does not cache = ./cache/216.txt txt = ./txt/216.txt Building ./etc/reader.txt 216 3330 24055 216 3330 24055 number of items: 3 sum of words: 10,731 average size in words: 10,731 average readability score: 88 nouns: things; men; people; sage; state; place; one; way; world; death; life; name; nothing; man; others; knowledge; kingdom; purpose; end; attributes; action; words; skill; mind; excellence; thing; stillness; earth; day; arms; root; person; degree; course; war; use; strength; sky; sight; self; rest; princes; mother; law; kings; hold; hand; government; favour; faith verbs: is; be; are; does; do; has; have; know; were; was; knows; keep; become; being; makes; make; called; seem; doing; see; having; did; come; seems; possessed; overcomes; had; am; act; said; made; give; found; call; takes; take; let; go; get; finds; carry; appeared; show; puts; named; lost; lose; look; hold; becomes adjectives: great; other; skilful; small; own; good; highest; unchanging; sure; able; strong; old; easy; such; full; free; difficult; weak; soft; sincere; sharp; same; more; low; greatest; greater; firm; evil; deep; bright; whole; simple; mysterious; intelligent; empty; complete; poor; non; many; lower; long; left; few; female; different; beautiful; violent; utmost; true; swift adverbs: not; therefore; so; thus; yet; always; still; long; only; more; away; also; up; out; now; most; hence; all; alone; very; then; again; together; soon; on; never; even; as; lightly; first; easily; there; instead; indeed; forth; far; equally; back; well; thereby; that; over; much; however; greatly; gradually; extensively; everywhere; ever; down pronouns: it; he; his; its; them; their; they; i; we; him; himself; themselves; our; me; one; my; itself; you; your; us; ourselves; her; myself proper nouns: tao; heaven; earth; way; great; lord; valley; tis; part; o''er; mother; men; yea; valleys; vacancy; twas; tse; tiger; they;--it; therewith; th; teh; subtle; streams; stream);--it; son; so;--it; semblance; rhinoceros; quality; originator; oft; officers; obscurity; mysterious; music; misery!--happiness; method; meanness:--he; man;--he; man; loud; li; legge; lao; king; james; it:--he; image; ill;-- keywords: thing; tao; man; heaven; great one topic; one dimension: tao file(s): titles(s): The Sayings of Confucius three topics; one dimension: tao; yes; yes file(s): ./cache/216.txt, , titles(s): The Tao Teh King, or the Tao and its Characteristics | The Sayings of Confucius | The Sayings of Confucius five topics; three dimensions: tao things does; yes forlorn foes; yes forlorn foes; yes forlorn foes; yes forlorn foes file(s): ./cache/216.txt, , , , titles(s): The Tao Teh King, or the Tao and its Characteristics | The Sayings of Confucius | The Sayings of Confucius | The Sayings of Confucius | The Sayings of Confucius Type: gutenberg title: subject-philosophyChinese-gutenberg date: 2021-06-07 time: 14:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Philosophy, Chinese" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 24055 author: Confucius title: The Sayings of Confucius date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 3330 author: Confucius title: The Analects of Confucius (from the Chinese Classics) date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 216 author: Laozi title: The Tao Teh King, or the Tao and its Characteristics date: words: 10731.0 sentences: 914.0 pages: flesch: 88.0 cache: ./cache/216.txt txt: ./txt/216.txt summary: When we can lay hold of the Tao of old to direct the things 1. When the Great Tao (Way or Method) ceased to be observed, am different from other men, but I value the nursing-mother (the Tao). 5. The relation of the Tao to all the world is like that of the great 3. Hence the sage is able (in the same way) to accomplish his great to) conduct (a government) according to the Great Tao, what I should 2. The great Tao (or way) is very level and easy; but people love the 4. The great state only wishes to unite men together and nourish them; Tao has of all things the most honoured place. previous state in which they were easy, and all great things from one 1. All the world says that, while my Tao is great, it yet appears 4. Therefore the sage knows (these things) of himself, but does not ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel