mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-australianPoetry-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/15524.txt inflating: ./tmp/input/input-file/16362.txt inflating: ./tmp/input/input-file/304.txt inflating: ./tmp/input/input-file/962.txt inflating: ./tmp/input/input-file/4730.txt inflating: ./tmp/input/input-file/1199.txt inflating: ./tmp/input/input-file/214.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-australianPoetry-gutenberg FILE: cache/15524.txt OUTPUT: txt/15524.txt FILE: cache/304.txt OUTPUT: txt/304.txt FILE: cache/214.txt OUTPUT: txt/214.txt FILE: cache/1199.txt OUTPUT: txt/1199.txt FILE: cache/4730.txt OUTPUT: txt/4730.txt FILE: cache/962.txt OUTPUT: txt/962.txt FILE: cache/16362.txt OUTPUT: txt/16362.txt === file2bib.sh === id: 1199 author: nan title: An Anthology of Australian Verse date: pages: extension: .txt txt: ./txt/1199.txt cache: ./cache/1199.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1199.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 1199 txt/../wrd/1199.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1199 txt/../ent/1199.ent 1199 txt/../pos/1199.pos 15524 txt/../pos/15524.pos 15524 txt/../wrd/15524.wrd 16362 txt/../pos/16362.pos 16362 txt/../wrd/16362.wrd 4730 txt/../pos/4730.pos 4730 txt/../wrd/4730.wrd 16362 txt/../ent/16362.ent 15524 txt/../ent/15524.ent 304 txt/../pos/304.pos === file2bib.sh === id: 15524 author: Dennis, C. J. (Clarence James) title: Digger Smith date: pages: extension: .txt txt: ./txt/15524.txt cache: ./cache/15524.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'15524.txt' 304 txt/../wrd/304.wrd 214 txt/../pos/214.pos 4730 txt/../ent/4730.ent === file2bib.sh === id: 16362 author: Dennis, C. J. (Clarence James) title: The Glugs of Gosh date: pages: extension: .txt txt: ./txt/16362.txt cache: ./cache/16362.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'16362.txt' 214 txt/../wrd/214.wrd 304 txt/../ent/304.ent 214 txt/../ent/214.ent === file2bib.sh === id: 4730 author: Dennis, C. J. (Clarence James) title: The Songs of a Sentimental Bloke date: pages: extension: .txt txt: ./txt/4730.txt cache: ./cache/4730.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'4730.txt' === file2bib.sh === id: 304 author: Paterson, A. B. (Andrew Barton) title: Rio Grande's Last Race, and Other Verses date: pages: extension: .txt txt: ./txt/304.txt cache: ./cache/304.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'304.txt' === file2bib.sh === id: 214 author: Lawson, Henry title: In the Days When the World Was Wide, and Other Verses date: pages: extension: .txt txt: ./txt/214.txt cache: ./cache/214.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'214.txt' 962 txt/../pos/962.pos 962 txt/../wrd/962.wrd 962 txt/../ent/962.ent === file2bib.sh === id: 962 author: Kendall, Henry title: The Poems of Henry Kendall With Biographical Note by Bertram Stevens date: pages: extension: .txt txt: ./txt/962.txt cache: ./cache/962.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'962.txt' Done mapping. Reducing subject-australianPoetry-gutenberg === reduce.pl bib === === reduce.pl bib === id = 214 author = Lawson, Henry title = In the Days When the World Was Wide, and Other Verses date = pages = extension = .txt mime = text/plain words = 29004 sentences = 2322 flesch = 97 summary = They raised new stars on the silent sea that filled their hearts with awe; When men were gallant and ships were good -roaming the wide world round. We fight like women, and feel as much; the thoughts of our hearts we guard; Till like a pallid river flow the faces in the street -The old year went, and the new returned, in the withering weeks of drought, He tramped away from the shanty there, when the days were long and hot, When a man is better away from home, and dead to the world, Out Back. All day long in the flies and heat the men of the outside track 'Twas a better land to live in, in the days o' long ago. Do you think the bush was better in the 'good old droving days', The ghost of the man that I might have been is gone from my heart to-day; cache = ./cache/214.txt txt = ./txt/214.txt === reduce.pl bib === id = 15524 author = Dennis, C. J. (Clarence James) title = Digger Smith date = pages = extension = .txt mime = text/plain words = 11934 sentences = 1728 flesch = 106 summary = Ole ways," she sez, "seems to 'ave changed their style, I 'ad me Queen be'ind?" Sez Begg, "Wot rot! Sez Missus Flood, "Jim's won a medal too While that ole mother told--Good Lord!" sez 'e But, up to now, I ain't 'eard none for Dad. Ole Flood, an' all 'is kind throughout the land, "Look 'ere," I sez, "you let me spell yeh, Dad. An' then 'e sez, "'Ave yeh fergot me, Bill?" "It ain't too bad," 'e sez, with 'is ole smile; My wife sez little things sometimes that nearly git me riled. "You 'ear a lot," sez little Digger Smith, She sez, 'I ain't 'eard talk so good Jim mightn't come back 'ome, yeh know. "'Ow would yeh like," I sez to 'im, an' stops. When Missus Flood sez, "Bill, _wot do you think_?" "Yeh done it, lad," sez Jim. "I'm thinkin' things," sez Digger Smith. cache = ./cache/15524.txt txt = ./txt/15524.txt === reduce.pl bib === id = 4730 author = Dennis, C. J. (Clarence James) title = The Songs of a Sentimental Bloke date = pages = extension = .txt mime = text/plain words = 15845 sentences = 2626 flesch = 104 summary = Fer, as the poit sez, me 'eart 'as got The pip wiv yearnin' fer--I dunno wot. Fer when I come ter think uv wot I been.... Fer when a bloke 'as come to know Doreen, Wot's jist plain stoush wiv us, right 'ere to-day, Sez 'e "I'll dope yeh, so they'll THINK yer dead." Then freedom ain't the thing fer wot 'e yearns. A lispin' maid, wiv 'air an' eyes like 'ers, Doreen she sez, "You'll 'ave to meet my Mar, "Young friend," 'e sez--an' tears wus in 'is eyes-I LIKES that pilot fer the things 'e said. Wiv my Doreen, an' now it's come to this! "You got a look," 'e sez, "like you could stay; "I got no time fer wasters, lad," sez 'e, "I got no time fer wasters, lad," sez 'e, "I got no time fer wasters, lad," sez 'e, Doreen, she sez 'e's got a poit's eyes; cache = ./cache/4730.txt txt = ./txt/4730.txt === reduce.pl bib === id = 16362 author = Dennis, C. J. (Clarence James) title = The Glugs of Gosh date = pages = extension = .txt mime = text/plain words = 14700 sentences = 1369 flesch = 101 summary = He's a Glug of the old Gosh school! "It's wrong!" said this Glug, whose name was Joi. Of the Glugs of Gosh and their great King Splosh, To trade with the Glugs came the Ogs to Gosh, Till every Glug in the land of Gosh Said Joi: "In Gosh there shall yet be one And he said, "There is much that a Glug should know; The Glugs climbed trees in the days of yore, Said the Glug called Joi, "This climbing trees For a Glug named Joi and a king called Splosh!" But every Glug, and great King Splosh And the Swanks were called to the great King Splosh, Said Sym, "I shall tinker, and still be a king." Said Sym: "Kind friends, and fellow Glugs; "I'm with Sir Stodge, 0 Glugs of Gosh! The Glugs still live in the land of Gosh, "Aw, don't be a Glug!" said the little red dog. cache = ./cache/16362.txt txt = ./txt/16362.txt === reduce.pl bib === id = 304 author = Paterson, A. B. (Andrew Barton) title = Rio Grande's Last Race, and Other Verses date = pages = extension = .txt mime = text/plain words = 24432 sentences = 2094 flesch = 97 summary = He turned away the good old horse that served him many days; Came up on deck like a dead man, paralysed body and brain; 'Twas Saltbush Bill, with his travelling sheep, was making his way to town; 'Twas Saltbush Bill, with his travelling sheep, was making his way to town; 'Steel spurs, of course?' said old Rooster Hall; 'Twas the horse thief, Andy Regan, that was hunted like a dog 'Twas the horse thief, Andy Regan, that was hunted like a dog And the way that he chanced on a fighting man to reckon with Saltbush Bill. Till the fighting man shot home his left on the ribs with a mighty clout, 'You led the trump,' the old man said They said their horse could jump like fun, and asked an amateur Men fight all shapes and sizes as the racing horses run, Men fight all shapes and sizes as the racing horses run, cache = ./cache/304.txt txt = ./txt/304.txt === reduce.pl bib === id = 962 author = Kendall, Henry title = The Poems of Henry Kendall With Biographical Note by Bertram Stevens date = pages = extension = .txt mime = text/plain words = 98280 sentences = 7806 flesch = 94 summary = Past long hillocks looking like to waves of ocean turned to stone; Like a dying echo roaming sadly round a far off hill. And I thought they bore a murmur like a voice from sleeping seas. Like to lone hearts weeping over loved ones they shall see no more; Fly, like wild hounds, at the darkness, crouching over sea and earth; Changes like to swift-winged shadows falling on a moony deep! And the lights like flowers shall blossom, in high Heaven's kindly bosom, Forests golden, mountains hoary--can he look and love like we? While Night is stealing round the land, like Time across my face; But touching the ways of her eyes are: she comes to my soul like a tune-Dreaming mem'ries fall like moonlight over silver sleeping seas. Whose love is like beautiful light on the sea. It passed like the breath of the night-wind away, cache = ./cache/962.txt txt = ./txt/962.txt Building ./etc/reader.txt 962 214 304 962 16362 214 number of items: 7 sum of words: 194,195 average size in words: 32,365 average readability score: 99 nouns: day; man; face; life; night; days; eyes; heart; sea; land; time; world; e; years; men; wind; light; song; things; way; love; feet; fire; rain; voice; hand; soul; place; friend; waters; sun; death; hills; head; name; thing; trees; home; mountain; rest; year; words; sez; horse; morning; wife; water; beauty; ways; thunder verbs: is; was; are; ''s; be; have; were; said; had; do; see; know; came; come; ''ve; has; go; been; let; did; made; ''m; say; think; take; sez; went; left; got; comes; m; seen; heard; hear; look; ai; saw; make; give; tell; done; find; took; ''re; knew; gone; set; shining; get; found adjectives: old; wild; little; sweet; dead; many; great; good; other; fair; last; white; green; strong; young; more; long; high; sad; full; deep; strange; dark; first; soft; red; grand; bright; fierce; weary; poor; mighty; black; best; blue; bitter; lonely; human; dear; own; new; beautiful; true; same; much; low; wide; like; cold; clear adverbs: not; so; then; n''t; never; down; up; now; out; away; here; there; far; ever; back; still; too; again; yet; as; long; well; only; all; very; on; just; more; off; once; in; ago; often; always; over; no; alone; by; soon; right; home; round; together; hard; rather; even; thus; first; quite; most pronouns: i; he; his; it; you; we; they; me; my; she; her; him; their; your; our; its; us; them; thy; ''em; thee; himself; mine; one; uv; yourself; meself; myself; oo; itself; yours; ye; thyself; ours; themselves; ourselves; yer; theirs; imself; herself; ''s; pelf; em; yeh''ll; thee--; hers; yerself; wife--; we''d; sat proper nouns: _; e; god; yeh; glug; twas; o''er; lord; glugs; doreen; gosh; bill; jim; smith; sym; jack; ere; ye; king; west; heaven; thou; kendall; wiv; sydney; swanks; father; ole; australia; love; fer; peter; lo; hath; digger; swank; flood; stodge; sir; wot; splosh; nevertire; jist; tis; south; thee; old; thy; star; bush keywords: like; man; god; day; smith; love; look; come; young; year; wot; wiv; wind; wild; west; twas; turn; thy; thee; sym; sydney; sweet; swanks; stodge; splosh; song; sir; shine; shanty; sez; sea; saltbush; rooster; rise; quog; poole; peter; paterson; past; pass; old; nevertire; lord; light; life; leave; king; kendall; jim; jack one topic; one dimension: like file(s): ./cache/15524.txt titles(s): Digger Smith three topics; one dimension: like; er; said file(s): ./cache/962.txt, ./cache/4730.txt, ./cache/304.txt titles(s): The Poems of Henry Kendall With Biographical Note by Bertram Stevens | The Songs of a Sentimental Bloke | Rio Grande''s Last Race, and Other Verses five topics; three dimensions: like face sea; er sez like; ll man like; said glug glugs; wholesome ordered meals file(s): ./cache/962.txt, ./cache/4730.txt, ./cache/304.txt, ./cache/16362.txt, titles(s): The Poems of Henry Kendall With Biographical Note by Bertram Stevens | The Songs of a Sentimental Bloke | Rio Grande''s Last Race, and Other Verses | The Glugs of Gosh | An Anthology of Australian Verse Type: gutenberg title: subject-australianPoetry-gutenberg date: 2021-06-01 time: 13:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Australian poetry" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 15524 author: Dennis, C. J. (Clarence James) title: Digger Smith date: words: 11934.0 sentences: 1728.0 pages: flesch: 106.0 cache: ./cache/15524.txt txt: ./txt/15524.txt summary: Ole ways," she sez, "seems to ''ave changed their style, I ''ad me Queen be''ind?" Sez Begg, "Wot rot! Sez Missus Flood, "Jim''s won a medal too While that ole mother told--Good Lord!" sez ''e But, up to now, I ain''t ''eard none for Dad. Ole Flood, an'' all ''is kind throughout the land, "Look ''ere," I sez, "you let me spell yeh, Dad. An'' then ''e sez, "''Ave yeh fergot me, Bill?" "It ain''t too bad," ''e sez, with ''is ole smile; My wife sez little things sometimes that nearly git me riled. "You ''ear a lot," sez little Digger Smith, She sez, ''I ain''t ''eard talk so good Jim mightn''t come back ''ome, yeh know. "''Ow would yeh like," I sez to ''im, an'' stops. When Missus Flood sez, "Bill, _wot do you think_?" "Yeh done it, lad," sez Jim. "I''m thinkin'' things," sez Digger Smith. id: 16362 author: Dennis, C. J. (Clarence James) title: The Glugs of Gosh date: words: 14700.0 sentences: 1369.0 pages: flesch: 101.0 cache: ./cache/16362.txt txt: ./txt/16362.txt summary: He''s a Glug of the old Gosh school! "It''s wrong!" said this Glug, whose name was Joi. Of the Glugs of Gosh and their great King Splosh, To trade with the Glugs came the Ogs to Gosh, Till every Glug in the land of Gosh Said Joi: "In Gosh there shall yet be one And he said, "There is much that a Glug should know; The Glugs climbed trees in the days of yore, Said the Glug called Joi, "This climbing trees For a Glug named Joi and a king called Splosh!" But every Glug, and great King Splosh And the Swanks were called to the great King Splosh, Said Sym, "I shall tinker, and still be a king." Said Sym: "Kind friends, and fellow Glugs; "I''m with Sir Stodge, 0 Glugs of Gosh! The Glugs still live in the land of Gosh, "Aw, don''t be a Glug!" said the little red dog. id: 4730 author: Dennis, C. J. (Clarence James) title: The Songs of a Sentimental Bloke date: words: 15845.0 sentences: 2626.0 pages: flesch: 104.0 cache: ./cache/4730.txt txt: ./txt/4730.txt summary: Fer, as the poit sez, me ''eart ''as got The pip wiv yearnin'' fer--I dunno wot. Fer when I come ter think uv wot I been.... Fer when a bloke ''as come to know Doreen, Wot''s jist plain stoush wiv us, right ''ere to-day, Sez ''e "I''ll dope yeh, so they''ll THINK yer dead." Then freedom ain''t the thing fer wot ''e yearns. A lispin'' maid, wiv ''air an'' eyes like ''ers, Doreen she sez, "You''ll ''ave to meet my Mar, "Young friend," ''e sez--an'' tears wus in ''is eyes-I LIKES that pilot fer the things ''e said. Wiv my Doreen, an'' now it''s come to this! "You got a look," ''e sez, "like you could stay; "I got no time fer wasters, lad," sez ''e, "I got no time fer wasters, lad," sez ''e, "I got no time fer wasters, lad," sez ''e, Doreen, she sez ''e''s got a poit''s eyes; id: 962 author: Kendall, Henry title: The Poems of Henry Kendall With Biographical Note by Bertram Stevens date: words: 98280.0 sentences: 7806.0 pages: flesch: 94.0 cache: ./cache/962.txt txt: ./txt/962.txt summary: Past long hillocks looking like to waves of ocean turned to stone; Like a dying echo roaming sadly round a far off hill. And I thought they bore a murmur like a voice from sleeping seas. Like to lone hearts weeping over loved ones they shall see no more; Fly, like wild hounds, at the darkness, crouching over sea and earth; Changes like to swift-winged shadows falling on a moony deep! And the lights like flowers shall blossom, in high Heaven''s kindly bosom, Forests golden, mountains hoary--can he look and love like we? While Night is stealing round the land, like Time across my face; But touching the ways of her eyes are: she comes to my soul like a tune-Dreaming mem''ries fall like moonlight over silver sleeping seas. Whose love is like beautiful light on the sea. It passed like the breath of the night-wind away, id: 214 author: Lawson, Henry title: In the Days When the World Was Wide, and Other Verses date: words: 29004.0 sentences: 2322.0 pages: flesch: 97.0 cache: ./cache/214.txt txt: ./txt/214.txt summary: They raised new stars on the silent sea that filled their hearts with awe; When men were gallant and ships were good -roaming the wide world round. We fight like women, and feel as much; the thoughts of our hearts we guard; Till like a pallid river flow the faces in the street -The old year went, and the new returned, in the withering weeks of drought, He tramped away from the shanty there, when the days were long and hot, When a man is better away from home, and dead to the world, Out Back. All day long in the flies and heat the men of the outside track ''Twas a better land to live in, in the days o'' long ago. Do you think the bush was better in the ''good old droving days'', The ghost of the man that I might have been is gone from my heart to-day; id: 304 author: Paterson, A. B. (Andrew Barton) title: Rio Grande''s Last Race, and Other Verses date: words: 24432.0 sentences: 2094.0 pages: flesch: 97.0 cache: ./cache/304.txt txt: ./txt/304.txt summary: He turned away the good old horse that served him many days; Came up on deck like a dead man, paralysed body and brain; ''Twas Saltbush Bill, with his travelling sheep, was making his way to town; ''Twas Saltbush Bill, with his travelling sheep, was making his way to town; ''Steel spurs, of course?'' said old Rooster Hall; ''Twas the horse thief, Andy Regan, that was hunted like a dog ''Twas the horse thief, Andy Regan, that was hunted like a dog And the way that he chanced on a fighting man to reckon with Saltbush Bill. Till the fighting man shot home his left on the ribs with a mighty clout, ''You led the trump,'' the old man said They said their horse could jump like fun, and asked an amateur Men fight all shapes and sizes as the racing horses run, Men fight all shapes and sizes as the racing horses run, id: 1199 author: nan title: An Anthology of Australian Verse date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel