mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-worldWideWeb-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/303.txt inflating: ./tmp/input/input-file/4742.txt inflating: ./tmp/input/input-file/33375.txt inflating: ./tmp/input/input-file/33374.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-worldWideWeb-gutenberg FILE: cache/303.txt OUTPUT: txt/303.txt FILE: cache/33375.txt OUTPUT: txt/33375.txt FILE: cache/33374.txt OUTPUT: txt/33374.txt FILE: cache/4742.txt OUTPUT: txt/4742.txt 303 txt/../wrd/303.wrd 303 txt/../ent/303.ent 33374 txt/../ent/33374.ent 33374 txt/../pos/33374.pos 33375 txt/../ent/33375.ent 303 txt/../pos/303.pos 4742 txt/../ent/4742.ent 33375 txt/../pos/33375.pos 4742 txt/../wrd/4742.wrd 33375 txt/../wrd/33375.wrd 4742 txt/../pos/4742.pos 33374 txt/../wrd/33374.wrd === file2bib.sh === id: 33375 author: Sawyer, Robert J. title: Watch (First 25,000 words) date: pages: extension: .txt txt: ./txt/33375.txt cache: ./cache/33375.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'33375.txt' === file2bib.sh === id: 303 author: Anonymous title: HomeBrew HomePages Put YOU on the World Wide Web date: pages: extension: .txt txt: ./txt/303.txt cache: ./cache/303.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'303.txt' === file2bib.sh === id: 33374 author: Sawyer, Robert J. title: Wake (First 25,000 words) date: pages: extension: .txt txt: ./txt/33374.txt cache: ./cache/33374.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'33374.txt' === file2bib.sh === id: 4742 author: Vaknin, Samuel title: TrendSiters Digital Content and Web Technologies date: pages: extension: .txt txt: ./txt/4742.txt cache: ./cache/4742.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'4742.txt' Done mapping. Reducing subject-worldWideWeb-gutenberg === reduce.pl bib === id = 4742 author = Vaknin, Samuel title = TrendSiters Digital Content and Web Technologies date = pages = extension = .txt mime = text/plain words = 30 sentences = 3 flesch = 86 summary = Copyright (C) 2007 by Lidija Rangelovska. Please see the corresponding RTF file for this eBook. RTF is Rich Text Format, and is readable in nearly any modern word processing program. cache = ./cache/4742.txt txt = ./txt/4742.txt === reduce.pl bib === id = 303 author = Anonymous title = HomeBrew HomePages Put YOU on the World Wide Web date = pages = extension = .txt mime = text/plain words = 48 sentences = 4 flesch = 94 summary = HomeBrew HomePages Put YOU on the World Wide Web The zip file homeb10.zip should contain all the material necessary to make a Web Page (c)1995 This is a Shareware Web Page you can use to make other Web Pages with for your own use, please read all files!!!!!!!!! cache = ./cache/303.txt txt = ./txt/303.txt === reduce.pl bib === id = 33374 author = Sawyer, Robert J. title = Wake (First 25,000 words) date = pages = extension = .txt mime = text/plain words = 29 sentences = 3 flesch = 80 summary = Copyright (C) 2009 by SFWRITER.COM Inc. This eBook is available in RTF format, please see the accompanying files. Note that it is an extract only, provided by the author. cache = ./cache/33374.txt txt = ./txt/33374.txt === reduce.pl bib === id = 33375 author = Sawyer, Robert J. title = Watch (First 25,000 words) date = pages = extension = .txt mime = text/plain words = 30 sentences = 3 flesch = 86 summary = Copyright (C) 2010 by Robert J. Sawyer This eBook is available in RTF format, please see the accompanying files. Note that it is an extract only, provided by the author. cache = ./cache/33375.txt txt = ./txt/33375.txt Building ./etc/reader.txt 4742 33375 33374 4742 33375 33374 number of items: 4 sum of words: 137 average size in words: 34 average readability score: 86 nouns: web; files; copyright; c; format; file; extract; author; zip; word; use; sfwriter.com; program; processing; material; homeb10.zip verbs: is; see; provided; note; make; accompanying; use; read; put; corresponding; contain adjectives: available; readable; own; other; necessary; modern adverbs: only; nearly pronouns: you; it; your proper nouns: rtf; ebook; page; world; wide; web; text; shareware; sawyer; robert; rich; rangelovska; pages; lidija; j.; inc.; homepages; homebrew; format; c)1995 keywords: web; sawyer; rtf; inc. one topic; one dimension: web file(s): ./cache/4742.txt titles(s): TrendSiters Digital Content and Web Technologies three topics; one dimension: rtf; web; files file(s): ./cache/4742.txt, ./cache/303.txt, ./cache/33374.txt titles(s): TrendSiters Digital Content and Web Technologies | HomeBrew HomePages Put YOU on the World Wide Web | Wake (First 25,000 words) five topics; three dimensions: files copyright format; web zip use; rtf file nearly; ebook copyright format; ebook copyright format file(s): ./cache/33375.txt, ./cache/303.txt, ./cache/4742.txt, ./cache/33374.txt, ./cache/33374.txt titles(s): Watch (First 25,000 words) | HomeBrew HomePages Put YOU on the World Wide Web | TrendSiters Digital Content and Web Technologies | Wake (First 25,000 words) | Wake (First 25,000 words) Type: gutenberg title: subject-worldWideWeb-gutenberg date: 2021-06-10 time: 17:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"World Wide Web" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 303 author: Anonymous title: HomeBrew HomePages Put YOU on the World Wide Web date: words: 48 sentences: 4 pages: flesch: 94 cache: ./cache/303.txt txt: ./txt/303.txt summary: HomeBrew HomePages Put YOU on the World Wide Web The zip file homeb10.zip should contain all the material necessary to make a Web Page (c)1995 This is a Shareware Web Page you can use to make other Web Pages with for your own use, please read all files!!!!!!!!! id: 33375 author: Sawyer, Robert J. title: Watch (First 25,000 words) date: words: 30 sentences: 3 pages: flesch: 86 cache: ./cache/33375.txt txt: ./txt/33375.txt summary: Copyright (C) 2010 by Robert J. Sawyer This eBook is available in RTF format, please see the accompanying files. Note that it is an extract only, provided by the author. id: 33374 author: Sawyer, Robert J. title: Wake (First 25,000 words) date: words: 29 sentences: 3 pages: flesch: 80 cache: ./cache/33374.txt txt: ./txt/33374.txt summary: Copyright (C) 2009 by SFWRITER.COM Inc. This eBook is available in RTF format, please see the accompanying files. Note that it is an extract only, provided by the author. id: 4742 author: Vaknin, Samuel title: TrendSiters Digital Content and Web Technologies date: words: 30 sentences: 3 pages: flesch: 86 cache: ./cache/4742.txt txt: ./txt/4742.txt summary: Copyright (C) 2007 by Lidija Rangelovska. Please see the corresponding RTF file for this eBook. RTF is Rich Text Format, and is readable in nearly any modern word processing program. ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel Error: near line 1: database is locked Send options without primary recipient specified. Usage: mailx -eiIUdEFntBDNHRVv~ -T FILE -u USER -h hops -r address -s SUBJECT -a FILE -q FILE -f FILE -A ACCOUNT -b USERS -c USERS -S OPTION users