mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-fisheries-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/17171.txt inflating: ./tmp/input/input-file/26560.txt inflating: ./tmp/input/input-file/26632.txt inflating: ./tmp/input/input-file/24808.txt inflating: ./tmp/input/input-file/15035.txt inflating: ./tmp/input/input-file/43856.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-fisheries-gutenberg FILE: cache/24808.txt OUTPUT: txt/24808.txt FILE: cache/17171.txt OUTPUT: txt/17171.txt FILE: cache/26632.txt OUTPUT: txt/26632.txt FILE: cache/26560.txt OUTPUT: txt/26560.txt FILE: cache/15035.txt OUTPUT: txt/15035.txt FILE: cache/43856.txt OUTPUT: txt/43856.txt === file2bib.sh === id: 24808 author: Wood, William title: All Afloat: A Chronicle of Craft and Waterways date: pages: extension: .txt txt: ./txt/24808.txt cache: ./cache/24808.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'24808.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 24808 txt/../pos/24808.pos 24808 txt/../ent/24808.ent 24808 txt/../wrd/24808.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 17171 txt/../pos/17171.pos 17171 txt/../wrd/17171.wrd 17171 txt/../ent/17171.ent 26632 txt/../pos/26632.pos === file2bib.sh === id: 17171 author: Various title: New England Salmon Hatcheries and Salmon Fisheries in the Late 19th Century date: pages: extension: .txt txt: ./txt/17171.txt cache: ./cache/17171.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'17171.txt' 26632 txt/../wrd/26632.wrd 26632 txt/../ent/26632.ent 15035 txt/../wrd/15035.wrd 15035 txt/../pos/15035.pos === file2bib.sh === id: 26632 author: Wharton, James title: The Bounty of the Chesapeake: Fishing in Colonial Virginia date: pages: extension: .txt txt: ./txt/26632.txt cache: ./cache/26632.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'26632.txt' 15035 txt/../ent/15035.ent 43856 txt/../wrd/43856.wrd 26560 txt/../wrd/26560.wrd 43856 txt/../pos/43856.pos 26560 txt/../pos/26560.pos 43856 txt/../ent/43856.ent === file2bib.sh === id: 15035 author: Rich, Walter H. (Walter Herbert) title: Fishing Grounds of the Gulf of Maine date: pages: extension: .txt txt: ./txt/15035.txt cache: ./cache/15035.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 6 resourceName b'15035.txt' 26560 txt/../ent/26560.ent === file2bib.sh === id: 43856 author: Ely, Wilmer M. (Wilmer Mateo) title: The Boy Chums Cruising in Florida Waters or, The Perils and Dangers of the Fishing Fleet date: pages: extension: .txt txt: ./txt/43856.txt cache: ./cache/43856.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'43856.txt' === file2bib.sh === id: 26560 author: Tolman, Albert Walter title: Jim Spurling, Fisherman or Making Good date: pages: extension: .txt txt: ./txt/26560.txt cache: ./cache/26560.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'26560.txt' Done mapping. Reducing subject-fisheries-gutenberg === reduce.pl bib === id = 17171 author = Various title = New England Salmon Hatcheries and Salmon Fisheries in the Late 19th Century date = pages = extension = .txt mime = text/plain words = 17306 sentences = 850 flesch = 72 summary = of the fishery this year is the great numbers of young salmon caught, S.--Kennebec salmon caught to-day in the Hudson River at Bath near much better food-fish than the salmon. years the number of salmon has largely increased, due mainly, no doubt, salmon fisheries in the following rivers, namely, the Penobscot, the inclosure made in Dead Brook, and a stock of breeding salmon placed the salmon at the ordinary fishing season, May, June, and July, and keep The salmon placed in this inclosure had to be carted in tanks of water salmon, after it has left the fresh-water rivers in which it spawns and fish are taken throughout the entire pound-net season, but are most region, a great many salmon were being taken in the pound nets. In 1893, 3 fish were taken, as follows: May 10, a salmon weighing 19 1893, 2 salmon weighing 10 or 12 pounds each were taken at that place. cache = ./cache/17171.txt txt = ./txt/17171.txt === reduce.pl bib === id = 26632 author = Wharton, James title = The Bounty of the Chesapeake: Fishing in Colonial Virginia date = pages = extension = .txt mime = text/plain words = 30458 sentences = 1628 flesch = 76 summary = _The Bounty of the Chesapeake; Fishing in Colonial Virginia._ By small rivers all the year there is a good plenty of small fish, so James river, the best waters for sturgeon in Virginia to this day. fish named by Colonial reporters are to be found in Virginia waters There are many more varieties of fish caught by Virginia fishermen expect at time of year to have a good fishing for cod, as both at of salt, fish, and profits of the land shall be for the tenants, conveyed quantities of salt fish to the Colony from Canada on his ship Colony in Virginia and that fish is worth not less than £600. time for fishing, that the salt or pickle would not keep them as in remain today among Virginia's most plentiful fish but the salting The fishermen of Virginia needed salt for their fish as _The Fish and Fisheries of Colonial Virginia._ In cache = ./cache/26632.txt txt = ./txt/26632.txt === reduce.pl bib === id = 15035 author = Rich, Walter H. (Walter Herbert) title = Fishing Grounds of the Gulf of Maine date = pages = extension = .txt mime = text/plain words = 44544 sentences = 3289 flesch = 88 summary = fishes--the cod, haddock, cusk, hake, pollock, and halibut--and each western shore of Nova Scotia is virtually all fishing ground for cod, fathoms on the shoal ground running from 5 miles from Gull Rock and the Rips furnish good cod and haddock fishing for the entire year, with hake ground from this point south to the Lurcher Shoal furnishes good fishing Island is all good ground in summer for cod and for pollock, also, when Principally Maine vessels fish this ground, using hand line and trawl. comparatively small ground, but it furnishes good cod fishing in the This is a cod and haddock ground at seasons when these fish are in in spring and fall and a haddock ground in winter and is fished by Principally a summer small-boat ground fished by hand lines, trawls, and Pollock Hub 3 miles) is a fishing ground for haddock in January and cache = ./cache/15035.txt txt = ./txt/15035.txt === reduce.pl bib === === reduce.pl bib === id = 43856 author = Ely, Wilmer M. (Wilmer Mateo) title = The Boy Chums Cruising in Florida Waters or, The Perils and Dangers of the Fishing Fleet date = pages = extension = .txt mime = text/plain words = 67541 sentences = 5054 flesch = 92 summary = "Both my chum and I would like to learn how to run the engine," Charley "Something that will not bear the light of day, I guess," said Charley, "Now look here, Hunter," Charley said coolly, "you fellows objected to "I never mentioned last night," said Charley, quickly, and Hunter Walter and the captain hurried to Charley and helped him up from the start out for real work," said Charley, cheerfully, ignoring his chum's Well, Charley and the captain would never want him to fish with them way home," Charley said as soon as he got the engine started. It still lacked an hour to time to go fishing and Charley lay down on He and Walter changed places, and while Charley picked out the fish "I believe the wind is going down a little," Charley said, shortly Followed by the captain and Chris, Charley headed for the little cache = ./cache/43856.txt txt = ./txt/43856.txt === reduce.pl bib === id = 26560 author = Tolman, Albert Walter title = Jim Spurling, Fisherman or Making Good date = pages = extension = .txt mime = text/plain words = 72348 sentences = 6778 flesch = 94 summary = "Got your letter last night, Jim," said he, "and I can tell you it took Captain Nemo, towing behind Spurling on his leash, got in Percy's way, Percy got the lower near the door, with Budge over him; while Spurling and lose our way," said Jim. The remainder of the morning was spent in fitting up the lobster-traps With a wry face Jim held the thing up for Percy's Percy had kept the _Barracouta_ near by as Jim pulled the dory along "Guess I've told you all I know, and more, too," said Jim. They were back in Sprowl's Cove at half past ten, and put their lobsters "Percy," said Jim as the sloop rolled rhythmically on the long Atlantic "Look at the pirate!" said Jim. Grasping a ganging well above the hook, he held the fish up for Percy's "Let me spell you at the oars, Jim," said Percy. cache = ./cache/26560.txt txt = ./txt/26560.txt Building ./etc/reader.txt 15035 26632 43856 15035 43856 17171 number of items: 6 sum of words: 232,197 average size in words: 46,439 average readability score: 84 nouns: fish; miles; ground; water; fathoms; fishing; bottom; time; cod; depths; night; salmon; hand; way; man; boys; island; year; summer; boat; part; feet; spring; sea; fishermen; captain; end; mile; line; day; grounds; nets; launch; work; years; side; hake; shore; salt; cabin; place; head; season; men; shoal; thing; winter; haddock; bank; dollars verbs: is; was; are; had; be; have; were; ''s; do; said; been; get; has; got; make; taken; made; ''ve; take; found; came; come; did; going; go; being; see; let; know; want; lies; done; caught; say; keep; ''re; think; took; give; went; put; ''m; lay; find; began; run; tell; brought; set; guess adjectives: good; little; other; small; few; long; more; last; first; great; many; old; same; much; large; best; rocky; wide; hard; such; white; distant; right; big; deep; western; abundant; own; most; next; better; short; eastern; present; bad; new; black; several; fresh; sharp; open; strong; only; muddy; low; high; heavy; young; full; considerable adverbs: not; n''t; up; out; here; so; now; as; about; then; down; back; only; just; off; soon; too; very; all; in; there; more; also; away; again; on; over; well; long; almost; even; far; still; right; much; around; never; once; most; enough; perhaps; together; first; always; nearly; ever; pretty; aboard; better; later pronouns: it; he; i; his; they; you; we; their; them; him; its; our; us; me; your; her; my; she; himself; ''em; ''s; themselves; myself; one; yourself; em; itself; ourselves; yours; ours; mine; theirs; sho; d''you; oneself; one''ll; herself; fisherman''d proper nouns: _; percy; charley; jim; island; ground; walter; bank; s.; virginia; captain; chris; cape; june; mr.; whittington; spurling; hunter; bay; e.; filippo; july; new; lane; shoal; ridge; maine; barracouta; budge; may; se; head; sw; rock; april; westfield; throppy; ledge; gulf; bill; march; john; roberts; w.; ne; light; tarpaulin; england; point; monhegan keywords: june; island; good; fish; whittington; westfield; water; washington; walter; virginia; united; throppy; tarpaulin; stevens; states; spurling; smith; salmon; roberts; ridge; percy; penobscot; new; mr.; matinicus; massa; london; little; lane; john; jim; jabe; indians; hunter; ground; george; filippo; england; dolph; daniels; company; commission; come; colony; clearwater; chris; chesapeake; chas; charley; chapter one topic; one dimension: fish file(s): ./cache/17171.txt titles(s): New England Salmon Hatcheries and Salmon Fisheries in the Late 19th Century three topics; one dimension: fish; percy; ground file(s): ./cache/43856.txt, ./cache/26560.txt, ./cache/15035.txt titles(s): The Boy Chums Cruising in Florida Waters or, The Perils and Dangers of the Fishing Fleet | Jim Spurling, Fisherman or Making Good | Fishing Grounds of the Gulf of Maine five topics; three dimensions: fish charley said; percy jim ll; ground miles fathoms; salmon fish water; html resulted extreme file(s): ./cache/43856.txt, ./cache/26560.txt, ./cache/15035.txt, ./cache/17171.txt, titles(s): The Boy Chums Cruising in Florida Waters or, The Perils and Dangers of the Fishing Fleet | Jim Spurling, Fisherman or Making Good | Fishing Grounds of the Gulf of Maine | New England Salmon Hatcheries and Salmon Fisheries in the Late 19th Century | All Afloat: A Chronicle of Craft and Waterways Type: gutenberg title: subject-fisheries-gutenberg date: 2021-06-06 time: 15:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Fisheries" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 43856 author: Ely, Wilmer M. (Wilmer Mateo) title: The Boy Chums Cruising in Florida Waters or, The Perils and Dangers of the Fishing Fleet date: words: 67541.0 sentences: 5054.0 pages: flesch: 92.0 cache: ./cache/43856.txt txt: ./txt/43856.txt summary: "Both my chum and I would like to learn how to run the engine," Charley "Something that will not bear the light of day, I guess," said Charley, "Now look here, Hunter," Charley said coolly, "you fellows objected to "I never mentioned last night," said Charley, quickly, and Hunter Walter and the captain hurried to Charley and helped him up from the start out for real work," said Charley, cheerfully, ignoring his chum''s Well, Charley and the captain would never want him to fish with them way home," Charley said as soon as he got the engine started. It still lacked an hour to time to go fishing and Charley lay down on He and Walter changed places, and while Charley picked out the fish "I believe the wind is going down a little," Charley said, shortly Followed by the captain and Chris, Charley headed for the little id: 15035 author: Rich, Walter H. (Walter Herbert) title: Fishing Grounds of the Gulf of Maine date: words: 44544.0 sentences: 3289.0 pages: flesch: 88.0 cache: ./cache/15035.txt txt: ./txt/15035.txt summary: fishes--the cod, haddock, cusk, hake, pollock, and halibut--and each western shore of Nova Scotia is virtually all fishing ground for cod, fathoms on the shoal ground running from 5 miles from Gull Rock and the Rips furnish good cod and haddock fishing for the entire year, with hake ground from this point south to the Lurcher Shoal furnishes good fishing Island is all good ground in summer for cod and for pollock, also, when Principally Maine vessels fish this ground, using hand line and trawl. comparatively small ground, but it furnishes good cod fishing in the This is a cod and haddock ground at seasons when these fish are in in spring and fall and a haddock ground in winter and is fished by Principally a summer small-boat ground fished by hand lines, trawls, and Pollock Hub 3 miles) is a fishing ground for haddock in January and id: 26560 author: Tolman, Albert Walter title: Jim Spurling, Fisherman or Making Good date: words: 72348.0 sentences: 6778.0 pages: flesch: 94.0 cache: ./cache/26560.txt txt: ./txt/26560.txt summary: "Got your letter last night, Jim," said he, "and I can tell you it took Captain Nemo, towing behind Spurling on his leash, got in Percy''s way, Percy got the lower near the door, with Budge over him; while Spurling and lose our way," said Jim. The remainder of the morning was spent in fitting up the lobster-traps With a wry face Jim held the thing up for Percy''s Percy had kept the _Barracouta_ near by as Jim pulled the dory along "Guess I''ve told you all I know, and more, too," said Jim. They were back in Sprowl''s Cove at half past ten, and put their lobsters "Percy," said Jim as the sloop rolled rhythmically on the long Atlantic "Look at the pirate!" said Jim. Grasping a ganging well above the hook, he held the fish up for Percy''s "Let me spell you at the oars, Jim," said Percy. id: 17171 author: Various title: New England Salmon Hatcheries and Salmon Fisheries in the Late 19th Century date: words: 17306.0 sentences: 850.0 pages: flesch: 72.0 cache: ./cache/17171.txt txt: ./txt/17171.txt summary: of the fishery this year is the great numbers of young salmon caught, S.--Kennebec salmon caught to-day in the Hudson River at Bath near much better food-fish than the salmon. years the number of salmon has largely increased, due mainly, no doubt, salmon fisheries in the following rivers, namely, the Penobscot, the inclosure made in Dead Brook, and a stock of breeding salmon placed the salmon at the ordinary fishing season, May, June, and July, and keep The salmon placed in this inclosure had to be carted in tanks of water salmon, after it has left the fresh-water rivers in which it spawns and fish are taken throughout the entire pound-net season, but are most region, a great many salmon were being taken in the pound nets. In 1893, 3 fish were taken, as follows: May 10, a salmon weighing 19 1893, 2 salmon weighing 10 or 12 pounds each were taken at that place. id: 26632 author: Wharton, James title: The Bounty of the Chesapeake: Fishing in Colonial Virginia date: words: 30458.0 sentences: 1628.0 pages: flesch: 76.0 cache: ./cache/26632.txt txt: ./txt/26632.txt summary: _The Bounty of the Chesapeake; Fishing in Colonial Virginia._ By small rivers all the year there is a good plenty of small fish, so James river, the best waters for sturgeon in Virginia to this day. fish named by Colonial reporters are to be found in Virginia waters There are many more varieties of fish caught by Virginia fishermen expect at time of year to have a good fishing for cod, as both at of salt, fish, and profits of the land shall be for the tenants, conveyed quantities of salt fish to the Colony from Canada on his ship Colony in Virginia and that fish is worth not less than £600. time for fishing, that the salt or pickle would not keep them as in remain today among Virginia''s most plentiful fish but the salting The fishermen of Virginia needed salt for their fish as _The Fish and Fisheries of Colonial Virginia._ In id: 24808 author: Wood, William title: All Afloat: A Chronicle of Craft and Waterways date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel