mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-corsicaFrance-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/15565.txt inflating: ./tmp/input/input-file/20263.txt inflating: ./tmp/input/input-file/19882.txt inflating: ./tmp/input/input-file/41881.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-corsicaFrance-gutenberg FILE: cache/41881.txt OUTPUT: txt/41881.txt FILE: cache/19882.txt OUTPUT: txt/19882.txt FILE: cache/20263.txt OUTPUT: txt/20263.txt FILE: cache/15565.txt OUTPUT: txt/15565.txt 19882 txt/../wrd/19882.wrd 19882 txt/../pos/19882.pos 41881 txt/../pos/41881.pos 19882 txt/../ent/19882.ent 41881 txt/../wrd/41881.wrd === file2bib.sh === id: 19882 author: Black, C. B. title: Itinerary through Corsica by its Rail, Carriage & Forest Roads date: pages: extension: .txt txt: ./txt/19882.txt cache: ./cache/19882.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'19882.txt' 41881 txt/../ent/41881.ent === file2bib.sh === id: 41881 author: Dumas, Alexandre title: The Corsican Brothers date: pages: extension: .txt txt: ./txt/41881.txt cache: ./cache/41881.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'41881.txt' 20263 txt/../wrd/20263.wrd 20263 txt/../pos/20263.pos 20263 txt/../ent/20263.ent 15565 txt/../pos/15565.pos 15565 txt/../wrd/15565.wrd === file2bib.sh === id: 20263 author: Boswell, James title: Boswell's Correspondence with the Honourable Andrew Erskine, and His Journal of a Tour to Corsica date: pages: extension: .txt txt: ./txt/20263.txt cache: ./cache/20263.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'20263.txt' 15565 txt/../ent/15565.ent === file2bib.sh === id: 15565 author: Quiller-Couch, Arthur title: Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 date: pages: extension: .txt txt: ./txt/15565.txt cache: ./cache/15565.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 6 resourceName b'15565.txt' Done mapping. Reducing subject-corsicaFrance-gutenberg === reduce.pl bib === id = 15565 author = Quiller-Couch, Arthur title = Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 date = pages = extension = .txt mime = text/plain words = 134380 sentences = 8448 flesch = 87 summary = The rogue said it, as I knew, to turn my father's suspicion, having So he began to tell it bead by bead, when a voice near at hand said "You are welcome, Sir John," said the prisoner, as my father faced "By the way, Nat," said I, looking up as I turned the page, "Your father, sir," said he, looking about him and seeming to sniff My uncle came forward, still like a man in a dream, mounted the dais "I thank you, sir," said the grey-headed preacher, stepping forward My father laid a hand on the old man's shoulder. "And this old man's sires," said my father to me, but so that he did "I believe you are right," said my father, after a long look to "Man," said Captain Pomery, looking about him, "we must be a poor set The Princess held it, and at Nat's head and feet stood Marc'antonio cache = ./cache/15565.txt txt = ./txt/15565.txt === reduce.pl bib === id = 20263 author = Boswell, James title = Boswell's Correspondence with the Honourable Andrew Erskine, and His Journal of a Tour to Corsica date = pages = extension = .txt mime = text/plain words = 65264 sentences = 4012 flesch = 79 summary = Boswell writes to tell his friend Temple--"I have hopes that our Johnson draws between Boswell's Account of Corsica, which forms more volume of letters that passed between Boswell and his friend The _great man_ now," he writes to his friend Temple. of letters, his book on Corsica brought him far greater pleasure than Boswell, I shall not praise your letter, because I know you have [Footnote 22: Boswell in a letter to his friend Temple, dated May 1st Dear ERSKINE,--What sort of a letter shall I now write to you? He said his great object was to form the Corsicans in such a manner that country." Then turning to the man, "Sir," said he, "Corsica makes it a [Footnote 125: "On the evening of October 10, 1769, I presented Dr. Johnson to General Paoli. [Footnote 130: "'Sir,' said Johnson, 'I am a friend to subordination, as cache = ./cache/20263.txt txt = ./txt/20263.txt === reduce.pl bib === id = 19882 author = Black, C. B. title = Itinerary through Corsica by its Rail, Carriage & Forest Roads date = pages = extension = .txt mime = text/plain words = 22981 sentences = 1908 flesch = 84 summary = CORSICA, its Rail, Carriage and Forest Roads, with 6 Maps from the important of the Forest roads extends S.W. to Porto by and Porto great Forest roads penetrate into the interior 16 railway and the road between Ajaccio and Corté near Vivario 27 The road now from Corté to Bastia traverses the Quilico Col, 1932 ft., the Francardo bridge, 856 ft., where it meets the great Forest Road from Junction here with road to Corté, 55½ miles, south-east, passing through miles eastwards to the Col Capronale, 4495 feet, in the forest of Ometa. the road having passed over the Col Staggiola, 930 feet, within a short valley of the Asco, with magnificent forest trees, to the village of the Col Cesario, 1200 ft., 10½ m.; the villages of Feliceto, inn, pop. Five miles beyond Cauro, the Sartène road attains the summit of the Col S.W. This forest road, No. 4, ascends the valley of the cache = ./cache/19882.txt txt = ./txt/19882.txt === reduce.pl bib === id = 41881 author = Dumas, Alexandre title = The Corsican Brothers date = pages = extension = .txt mime = text/plain words = 30638 sentences = 2288 flesch = 86 summary = He came to announce that his young master, Monsieur Lucien de Franchi, "Now," said I to Lucien, who all this time had been dressing, "let us "I am afraid that we have kept you waiting, mother," said Lucien; "I Madame de Franchi smiled sadly, and said, "The absent are in the hands "Lucien," said Madame de Franchi, "remember you speak for yourself. "Mother," said Lucien as he rose, "you will excuse our leaving you, "My dear, sir," said Lucien, "Orlandi wishes to shake you by the hand, "Yes," said Lucien, "I confess that I should." "Bah!" said Orlandi; "why, with a carbine, Monsieur Lucien could hit a "Come along," said Lucien, "if you wish to see the play. "Well, my dear Monsieur Orlandi," I said, extending my hand, "I wish "Thank you, monsieur," said Louis to D----, holding out his hand to "Not yet, monsieur," said the young unknown addressing Chateau Renaud, cache = ./cache/41881.txt txt = ./txt/41881.txt Building ./etc/reader.txt 15565 20263 41881 15565 41881 20263 number of items: 4 sum of words: 253,263 average size in words: 63,315 average readability score: 84 nouns: man; father; time; m.; day; way; road; men; hand; head; eyes; house; moment; letter; island; brother; sir; friend; years; night; miles; face; nothing; hands; sea; part; ft; country; one; life; side; feet; people; p.; place; voice; door; uncle; woman; name; days; room; mind; water; world; village; words; forest; word; minutes verbs: was; is; had; have; be; said; are; were; has; been; do; am; did; see; made; say; came; come; tell; know; being; take; found; took; make; having; let; saw; heard; think; turned; told; stood; put; left; go; went; find; answered; give; believe; ''s; called; asked; seemed; gave; lay; brought; replied; remember adjectives: good; great; little; other; own; more; first; same; last; old; young; many; much; few; such; dear; next; long; poor; full; short; high; corsican; best; small; whole; sure; second; least; better; fine; very; large; dead; open; true; less; able; wild; white; most; beautiful; strange; present; new; french; certain; tall; sudden; noble adverbs: not; so; up; then; very; now; here; out; down; only; again; never; even; most; more; too; back; n''t; well; yet; there; as; still; off; also; ever; away; once; just; on; indeed; almost; much; far; all; in; first; however; no; together; rather; perhaps; over; soon; long; enough; therefore; about; quite; always pronouns: i; he; you; it; my; his; me; him; we; her; they; your; she; their; them; us; our; its; himself; myself; themselves; itself; herself; ''em; yourself; ourselves; one; yours; mine; thy; thee; em; ours; theirs; je; hers; yt; yourselves; ye; whereof; whence; thyself; sure--; midmost; iv; io; idées; hisself; hey; hand--"you proper nouns: _; mr.; corsica; de; princess; sir; .; paoli; boswell; billy; marc''antonio; john; lucien; corsicans; footnote; genoese; col; m.; louis; badcock; fett; nat; prince; god; stephanu; king; ft; ajaccio; captain; johnson; bastia; la; france; chateau; general; franchi; lord; renaud; priske; genoa; prosper; s.; london; pomery; england; orlandi; james; sartène; i.; theodore keywords: corsica; sir; mr.; man; king; genoese; france; corsicans; theodore; stephanu; signor; sartène; rousseau; renaud; queen; prosper; priske; princess; pomery; paris; paoli; orlandi; ode; nat; napoleon; mount; monte; monsieur; mayor; lucien; louis; lord; london; letter; lady; johnson; john; james; headnote; great; good; god; gervase; genoa; general; gauntlet; french; franchi; footnote; fiennes one topic; one dimension: said file(s): ./cache/15565.txt titles(s): Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 three topics; one dimension: said; ft; recording file(s): ./cache/15565.txt, ./cache/19882.txt, ./cache/41881.txt titles(s): Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 | Itinerary through Corsica by its Rail, Carriage & Forest Roads | The Corsican Brothers five topics; three dimensions: said man father; ft road col; said lucien yes; 1807 abundance strikes; 1807 abundance strikes file(s): ./cache/15565.txt, ./cache/19882.txt, ./cache/41881.txt, ./cache/41881.txt, ./cache/41881.txt titles(s): Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 | Itinerary through Corsica by its Rail, Carriage & Forest Roads | The Corsican Brothers | The Corsican Brothers | The Corsican Brothers Type: gutenberg title: subject-corsicaFrance-gutenberg date: 2021-06-03 time: 19:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Corsica (France)" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 19882 author: Black, C. B. title: Itinerary through Corsica by its Rail, Carriage & Forest Roads date: words: 22981 sentences: 1908 pages: flesch: 84 cache: ./cache/19882.txt txt: ./txt/19882.txt summary: CORSICA, its Rail, Carriage and Forest Roads, with 6 Maps from the important of the Forest roads extends S.W. to Porto by and Porto great Forest roads penetrate into the interior 16 railway and the road between Ajaccio and Corté near Vivario 27 The road now from Corté to Bastia traverses the Quilico Col, 1932 ft., the Francardo bridge, 856 ft., where it meets the great Forest Road from Junction here with road to Corté, 55½ miles, south-east, passing through miles eastwards to the Col Capronale, 4495 feet, in the forest of Ometa. the road having passed over the Col Staggiola, 930 feet, within a short valley of the Asco, with magnificent forest trees, to the village of the Col Cesario, 1200 ft., 10½ m.; the villages of Feliceto, inn, pop. Five miles beyond Cauro, the Sartène road attains the summit of the Col S.W. This forest road, No. 4, ascends the valley of the id: 20263 author: Boswell, James title: Boswell''s Correspondence with the Honourable Andrew Erskine, and His Journal of a Tour to Corsica date: words: 65264 sentences: 4012 pages: flesch: 79 cache: ./cache/20263.txt txt: ./txt/20263.txt summary: Boswell writes to tell his friend Temple--"I have hopes that our Johnson draws between Boswell''s Account of Corsica, which forms more volume of letters that passed between Boswell and his friend The _great man_ now," he writes to his friend Temple. of letters, his book on Corsica brought him far greater pleasure than Boswell, I shall not praise your letter, because I know you have [Footnote 22: Boswell in a letter to his friend Temple, dated May 1st Dear ERSKINE,--What sort of a letter shall I now write to you? He said his great object was to form the Corsicans in such a manner that country." Then turning to the man, "Sir," said he, "Corsica makes it a [Footnote 125: "On the evening of October 10, 1769, I presented Dr. Johnson to General Paoli. [Footnote 130: "''Sir,'' said Johnson, ''I am a friend to subordination, as id: 41881 author: Dumas, Alexandre title: The Corsican Brothers date: words: 30638 sentences: 2288 pages: flesch: 86 cache: ./cache/41881.txt txt: ./txt/41881.txt summary: He came to announce that his young master, Monsieur Lucien de Franchi, "Now," said I to Lucien, who all this time had been dressing, "let us "I am afraid that we have kept you waiting, mother," said Lucien; "I Madame de Franchi smiled sadly, and said, "The absent are in the hands "Lucien," said Madame de Franchi, "remember you speak for yourself. "Mother," said Lucien as he rose, "you will excuse our leaving you, "My dear, sir," said Lucien, "Orlandi wishes to shake you by the hand, "Yes," said Lucien, "I confess that I should." "Bah!" said Orlandi; "why, with a carbine, Monsieur Lucien could hit a "Come along," said Lucien, "if you wish to see the play. "Well, my dear Monsieur Orlandi," I said, extending my hand, "I wish "Thank you, monsieur," said Louis to D----, holding out his hand to "Not yet, monsieur," said the young unknown addressing Chateau Renaud, id: 15565 author: Quiller-Couch, Arthur title: Sir John Constantine Memoirs of His Adventures At Home and Abroad and Particularly in the Island of Corsica: Beginning with the Year 1756 date: words: 134380 sentences: 8448 pages: flesch: 87 cache: ./cache/15565.txt txt: ./txt/15565.txt summary: The rogue said it, as I knew, to turn my father''s suspicion, having So he began to tell it bead by bead, when a voice near at hand said "You are welcome, Sir John," said the prisoner, as my father faced "By the way, Nat," said I, looking up as I turned the page, "Your father, sir," said he, looking about him and seeming to sniff My uncle came forward, still like a man in a dream, mounted the dais "I thank you, sir," said the grey-headed preacher, stepping forward My father laid a hand on the old man''s shoulder. "And this old man''s sires," said my father to me, but so that he did "I believe you are right," said my father, after a long look to "Man," said Captain Pomery, looking about him, "we must be a poor set The Princess held it, and at Nat''s head and feet stood Marc''antonio ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel