mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-indonesia-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/15685.txt inflating: ./tmp/input/input-file/16768.txt inflating: ./tmp/input/input-file/44705.txt inflating: ./tmp/input/input-file/60751.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-indonesia-gutenberg FILE: cache/44705.txt OUTPUT: txt/44705.txt FILE: cache/15685.txt OUTPUT: txt/15685.txt FILE: cache/60751.txt OUTPUT: txt/60751.txt FILE: cache/16768.txt OUTPUT: txt/16768.txt 44705 txt/../pos/44705.pos 44705 txt/../wrd/44705.wrd === file2bib.sh === id: 44705 author: Miller, Gerrit S. (Gerrit Smith) title: Mammals Collected by Dr. W. L. Abbott on the Natuna Islands Proceedings of the Washington Academy of Sciences, Vol. III, pp. 111-138 date: pages: extension: .txt txt: ./txt/44705.txt cache: ./cache/44705.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'44705.txt' 44705 txt/../ent/44705.ent 15685 txt/../pos/15685.pos 15685 txt/../wrd/15685.wrd 15685 txt/../ent/15685.ent === file2bib.sh === id: 15685 author: Dampier, William title: A Continuation of a Voyage to New Holland, Etc. in the Year 1699 date: pages: extension: .txt txt: ./txt/15685.txt cache: ./cache/15685.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'15685.txt' 16768 txt/../pos/16768.pos 16768 txt/../wrd/16768.wrd 16768 txt/../ent/16768.ent 60751 txt/../pos/60751.pos 60751 txt/../wrd/60751.wrd 60751 txt/../ent/60751.ent === file2bib.sh === id: 16768 author: Marsden, William title: The History of Sumatra Containing An Account Of The Government, Laws, Customs And Manners Of The Native Inhabitants date: pages: extension: .txt txt: ./txt/16768.txt cache: ./cache/16768.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 23 resourceName b'16768.txt' === file2bib.sh === id: 60751 author: Perelaer, M. T. H. (Michael Theophile Hubert) title: Baboe Dalima; or, The Opium Fiend date: pages: extension: .txt txt: ./txt/60751.txt cache: ./cache/60751.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 10 resourceName b'60751.txt' Done mapping. Reducing subject-indonesia-gutenberg === reduce.pl bib === id = 15685 author = Dampier, William title = A Continuation of a Voyage to New Holland, Etc. in the Year 1699 date = pages = extension = .txt mime = text/plain words = 45677 sentences = 2217 flesch = 85 summary = was a passage between the west end of Timor and another small island sandy island (over against the fort) full of bays and pretty high trees; the east or west of it; and near the shore it appeared like an island. us, we soon got abreast of the bay, and then saw a small island to the A DESCRIPTION OF A SMALL ISLAND, SEVEN LEAGUES EAST FROM THE WATERING BAY. At the south-west end of Timor is a pretty high island called Anabao. small flat island to the north-west of the others, and saw a great deal distance off at sea the west point appears like a cape land; the north long, and at the south-west point there is another small low woody island we were shot in within 2 leagues of the island the wind came to the west, sun-setting, I saw a small round high island to the west of Pentare, cache = ./cache/15685.txt txt = ./txt/15685.txt === reduce.pl bib === id = 16768 author = Marsden, William title = The History of Sumatra Containing An Account Of The Government, Laws, Customs And Manners Of The Native Inhabitants date = pages = extension = .txt mime = text/plain words = 199240 sentences = 9165 flesch = 68 summary = manuscript written about the year 1173, speaks of a large island called islands, says it obtains its appellation from a certain high land called Along the western coast of the island the low country, or space of land The personal difference between the Malays of the coast and the country resemble a small turban; the country people usually twisting a piece of in small quantities in different parts of the island, particularly in Many of the princes or chiefs in different parts of the island having the gold country, which points out the different places where they work different parts of the island, but chiefly near the sea-coast, and in the the country they inhabit is an island, or have any general name for it. people, but in time the coast became generally known by that of Tanah and at the same time great officers of state, who resided at places named cache = ./cache/16768.txt txt = ./txt/16768.txt === reduce.pl bib === id = 44705 author = Miller, Gerrit S. (Gerrit Smith) title = Mammals Collected by Dr. W. L. Abbott on the Natuna Islands Proceedings of the Washington Academy of Sciences, Vol. III, pp. 111-138 date = pages = extension = .txt mime = text/plain words = 10291 sentences = 1035 flesch = 75 summary = 17-19), Bunguran, or Great Natuna Island (June 24-July 31) and Pulo teeth distinctly worn, is smaller than in Bunguran specimens so young _Sciurus tenuis_ THOMAS and HARTERT, Novitates Zoologicæ, _Sciurus tenuis_ THOMAS and HARTERT, Novitates Zoologicæ, _Sciurus lowi_ THOMAS and HARTERT, Novitates Zoologicæ, _? Sciurus lowi natunensis_ THOMAS and HARTERT, Novitates _? Sciurus lowi natunensis_ THOMAS and HARTERT, Novitates _Skull._--As compared with the Bornean form of _Sciurus notatus_, the lutescens_ from Sirhassen Island, but upper parts slightly less pale, _Color._--Upper parts as in _Sciurus lutescens_ except that the _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates Collected on Pulo Laut, North Natuna Islands, August 6, 1900. _Color._--Upper parts and tail as in _Sciurus lutescens_. colored Bunguran form, with which it more nearly agrees in size. size, color and external form, but skull with broader rostrum, and cache = ./cache/44705.txt txt = ./txt/44705.txt === reduce.pl bib === id = 60751 author = Perelaer, M. T. H. (Michael Theophile Hubert) title = Baboe Dalima; or, The Opium Fiend date = pages = extension = .txt mime = text/plain words = 224993 sentences = 13328 flesch = 82 summary = "I say," said Mrs. van Gulpendam, addressing her husband, "Dalima "Come, Dalima," said van Gulpendam, with some kindness in his voice, "Good evening, madam," said van Nerekool as he made his bow to the "But do you know for certain, Miss Anna," said van Nerekool, under Mr. van Nerekool," said Anna, "I really cannot tell you all "That is the man," replied van Nerekool, as he looked down anxiously "Oh so," said van Gulpendam, with a laugh, "the babah has come on Such was the state of things when Resident van Gulpendam gave Lim Yang young man's time at college, Mrs. van Nerekool died somewhat suddenly, "Yes, madam, I hear," said van Nerekool, drily, "I know that he did "Come, Charles," said Verstork, laying his hand on his friend's "Yes, my friend," said van Nerekool very sadly. "Don't look at things so darkly," said van Nerekool. cache = ./cache/60751.txt txt = ./txt/60751.txt Building ./etc/reader.txt 60751 16768 15685 60751 44705 16768 number of items: 4 sum of words: 480,201 average size in words: 120,050 average readability score: 77 nouns: time; man; opium; island; people; country; place; men; water; part; length; day; sea; girl; side; nothing; way; natives; hand; land; night; head; king; coast; name; parts; inhabitants; words; word; morning; north; kind; trees; father; feet; tree; species; islands; ground; course; west; state; years; number; house; person; year; fact; distance; hands verbs: is; was; had; be; have; are; were; been; said; has; do; being; made; found; did; cried; called; know; come; see; make; came; having; am; take; saw; say; continued; go; asked; get; tell; replied; give; taken; put; sent; think; took; began; let; told; brought; find; heard; seemed; used; stood; went; got adjectives: other; little; such; small; great; many; same; young; good; more; much; few; large; poor; first; high; own; long; general; several; considerable; white; full; last; certain; old; different; common; next; latter; whole; former; single; present; black; possible; fine; south; able; strong; dutch; least; dear; low; most; deep; right; greater; greatest; true adverbs: not; very; so; then; now; up; as; more; most; also; out; well; here; only; there; about; again; much; down; however; just; off; away; thus; even; all; still; soon; on; indeed; never; once; too; far; in; yet; quite; perhaps; always; enough; almost; sometimes; n''t; no; at; therefore; rather; ever; back; long pronouns: it; i; he; his; they; you; their; her; we; them; she; my; him; me; its; our; us; your; himself; themselves; itself; myself; herself; one; yourself; ourselves; mine; yours; theirs; ours; oneself; hers; m`bok; yourselves; thy; thee; spot--; purchase­money; ipu; d''ilhir proper nouns: van; _; mr.; anna; resident; gulpendam; nerekool; dalima; lim; verstork; sumatra; ho; ardjan; laurentia; grenits; santjoemeh; rheijn; charles; meidema; achin; island; india; mrs.; chinaman; javanese; kandjeng; java; beneden; malacca; pulo; malayan; dutch; new; europeans; yang; malays; company; bing; hut; portuguese; footnote; grashuis; toean; kaligaweh; government; east; batavia; william; nana; thomas keywords: portuguese; mr.; island; dutch; zuidhoorn; zoologicæ; yang; west; volume; verstork; transaction; timor; thomas; sungei; sumatrans; sumatra; st.; singomengolo; setrosmito; santjoemeh; river; rheijn; resident; raja; pulo; plate; pidir; person; people; pase; palembang; pahit; padang; novitates; new; nerekool; nana; murowski; mrs.; mode; moco; miss; menangkabau; meidema; marsden; marlborough; man; malays; malayan; malacca one topic; one dimension: said file(s): ./cache/15685.txt titles(s): A Continuation of a Voyage to New Holland, Etc. in the Year 1699 three topics; one dimension: van; country; island file(s): ./cache/60751.txt, ./cache/16768.txt, ./cache/15685.txt titles(s): Baboe Dalima; or, The Opium Fiend | The History of Sumatra Containing An Account Of The Government, Laws, Customs And Manners Of The Native Inhabitants | A Continuation of a Voyage to New Holland, Etc. in the Year 1699 five topics; three dimensions: van said opium; country people called; island west saw; iv 73 102; iv 73 102 file(s): ./cache/60751.txt, ./cache/16768.txt, ./cache/15685.txt, ./cache/44705.txt, ./cache/44705.txt titles(s): Baboe Dalima; or, The Opium Fiend | The History of Sumatra Containing An Account Of The Government, Laws, Customs And Manners Of The Native Inhabitants | A Continuation of a Voyage to New Holland, Etc. in the Year 1699 | Mammals Collected by Dr. W. L. Abbott on the Natuna Islands Proceedings of the Washington Academy of Sciences, Vol. III, pp. 111-138 | Mammals Collected by Dr. W. L. Abbott on the Natuna Islands Proceedings of the Washington Academy of Sciences, Vol. III, pp. 111-138 Type: gutenberg title: subject-indonesia-gutenberg date: 2021-06-06 time: 18:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Indonesia" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 15685 author: Dampier, William title: A Continuation of a Voyage to New Holland, Etc. in the Year 1699 date: words: 45677 sentences: 2217 pages: flesch: 85 cache: ./cache/15685.txt txt: ./txt/15685.txt summary: was a passage between the west end of Timor and another small island sandy island (over against the fort) full of bays and pretty high trees; the east or west of it; and near the shore it appeared like an island. us, we soon got abreast of the bay, and then saw a small island to the A DESCRIPTION OF A SMALL ISLAND, SEVEN LEAGUES EAST FROM THE WATERING BAY. At the south-west end of Timor is a pretty high island called Anabao. small flat island to the north-west of the others, and saw a great deal distance off at sea the west point appears like a cape land; the north long, and at the south-west point there is another small low woody island we were shot in within 2 leagues of the island the wind came to the west, sun-setting, I saw a small round high island to the west of Pentare, id: 16768 author: Marsden, William title: The History of Sumatra Containing An Account Of The Government, Laws, Customs And Manners Of The Native Inhabitants date: words: 199240 sentences: 9165 pages: flesch: 68 cache: ./cache/16768.txt txt: ./txt/16768.txt summary: manuscript written about the year 1173, speaks of a large island called islands, says it obtains its appellation from a certain high land called Along the western coast of the island the low country, or space of land The personal difference between the Malays of the coast and the country resemble a small turban; the country people usually twisting a piece of in small quantities in different parts of the island, particularly in Many of the princes or chiefs in different parts of the island having the gold country, which points out the different places where they work different parts of the island, but chiefly near the sea-coast, and in the the country they inhabit is an island, or have any general name for it. people, but in time the coast became generally known by that of Tanah and at the same time great officers of state, who resided at places named id: 44705 author: Miller, Gerrit S. (Gerrit Smith) title: Mammals Collected by Dr. W. L. Abbott on the Natuna Islands Proceedings of the Washington Academy of Sciences, Vol. III, pp. 111-138 date: words: 10291 sentences: 1035 pages: flesch: 75 cache: ./cache/44705.txt txt: ./txt/44705.txt summary: 17-19), Bunguran, or Great Natuna Island (June 24-July 31) and Pulo teeth distinctly worn, is smaller than in Bunguran specimens so young _Sciurus tenuis_ THOMAS and HARTERT, Novitates Zoologicæ, _Sciurus tenuis_ THOMAS and HARTERT, Novitates Zoologicæ, _Sciurus lowi_ THOMAS and HARTERT, Novitates Zoologicæ, _? Sciurus lowi natunensis_ THOMAS and HARTERT, Novitates _? Sciurus lowi natunensis_ THOMAS and HARTERT, Novitates _Skull._--As compared with the Bornean form of _Sciurus notatus_, the lutescens_ from Sirhassen Island, but upper parts slightly less pale, _Color._--Upper parts as in _Sciurus lutescens_ except that the _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates _Sciurus notatus_ THOMAS and HARTERT, Novitates Collected on Pulo Laut, North Natuna Islands, August 6, 1900. _Color._--Upper parts and tail as in _Sciurus lutescens_. colored Bunguran form, with which it more nearly agrees in size. size, color and external form, but skull with broader rostrum, and id: 60751 author: Perelaer, M. T. H. (Michael Theophile Hubert) title: Baboe Dalima; or, The Opium Fiend date: words: 224993 sentences: 13328 pages: flesch: 82 cache: ./cache/60751.txt txt: ./txt/60751.txt summary: "I say," said Mrs. van Gulpendam, addressing her husband, "Dalima "Come, Dalima," said van Gulpendam, with some kindness in his voice, "Good evening, madam," said van Nerekool as he made his bow to the "But do you know for certain, Miss Anna," said van Nerekool, under Mr. van Nerekool," said Anna, "I really cannot tell you all "That is the man," replied van Nerekool, as he looked down anxiously "Oh so," said van Gulpendam, with a laugh, "the babah has come on Such was the state of things when Resident van Gulpendam gave Lim Yang young man''s time at college, Mrs. van Nerekool died somewhat suddenly, "Yes, madam, I hear," said van Nerekool, drily, "I know that he did "Come, Charles," said Verstork, laying his hand on his friend''s "Yes, my friend," said van Nerekool very sadly. "Don''t look at things so darkly," said van Nerekool. ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel