![]() |
| Home > Science > ai-faq > general > |
Artificial Intelligence FAQ:5/6 AI Web Directories & Online Papers [Monthly posting] |
Section 2 of 2 - Prev - Next
ftp -- ftp.mrg.dist.unige.it:/pub/mrg-ftp
email -- Fausto Giunchiglia
Mechanized Reasoning Group, IRST
38050 Povo Trento, Italy
Tel: +39 461-314444 (secr.)
+39 461-314436 (office)
Fax: +39 461-302040 / 314591
National University of Singapore:
ftp -- ftp.nus.sg:/pub/NUS/ISCS/techreports
New York University (NYU):
ftp -- cs.nyu.edu:/pub/tech-reports
OGI:
ftp -- cse.ogi.edu:/pub/tech-reports
email -- csedept@cse.ogi.edu
Ohio State University, Laboratory for AI Research
ftp -- nervous.cis.ohio-state.edu:/pub/papers
email -- lair-librarian@cis.ohio-state.edu
OSU Neuroprose:
ftp -- archive.cis.ohio-state.edu:/pub/neuroprose (128.146.8.52)
This directory contains technical reports as a public service to the
connectionist and neural network scientific community which has an
organized mailing list (for info: connectionists-request@cs.cmu.edu)
Includes several bibliographies.
Stanford:
ftp -- elib.stanford.edu:/cs
Very spotty collection.
SRI:
email -- Donna O'Neal, donna@ai.sri.com
SUNY Buffalo:
ftp -- ftp.cs.buffalo.edu:/pub/tech-reports/
SUNY at Stony Brook:
ftp -- sbcs.sunysb.edu:/pub/TechReports
email -- rick@cs.sunysb.edu or stark@cs.sunysb.edu
The /pub/sunysb directory contains the SB-Prolog implementation
of the Prolog language. Contact warren@sbcs.sunysb.edu for more
information.
TCGA (The Clearinghouse for Genetic Algorithms):
email -- Robert Elliott Smith
Department of Engineering of Mechanics
Room 210 Hardaway Hall
The University of Alabama
PO Box 870278
Tuscaloosa, AL 35487
205-348-1618, fax 205-348-6419
Thinking Machines:
ftp -- ftp.think.com:/think/techreport.list
This file contains a list of Thinking Machines technical reports.
Orders may be placed by email (limit 5) to t-rex@think.com, or by US
Mail to Thinking Machines Corporation, Attn: Technical reports, 245
First Street, Cambridge, MA 01241. In addition, the directories
cm/starlisp and cm/starlogo contain code for the *Lisp and *Logo
simulators.
Tulane University:
ftp -- rex.cs.tulane.edu:/pub/tech/ [129.81.132.1]
University of Alabama:
ftp -- aramis.cs.ua.edu:/pub/tech-reports/
University of Arizona:
ftp -- cs.arizona.edu:/reports/
email -- tr_libr@cs.arizona.edu
The directory /japan/kahaner.reports contains reports on AI in
Japan, among other things, written by Dr. David Kahaner, a
numerical analyst on sabbatical to the Office of Naval
Research-Asia (ONR Asia) in Tokyo from NIST. The reports are not
written in any sort of official capacity, but are quite interesting.
University of California/Los Angeles:
ftp -- ftp.cs.ucla.edu:/tech-report/
University of California/Santa Cruz:
ftp -- ftp.cse.ucsc.edu:/pub/bib/
ftp.cse.ucsc.edu:/pub/tr/
email -- jean@cs.ucsc.edu
University of Cambridge Computer Lab:
email -- tech-reports@cl.cam.ac.uk
University of Colorado:
ftp -- ftp.cs.colorado.edu:/pub/cs/techreports
University of Florida:
ftp -- bikini.cis.ufl.edu:/cis/tech-reports
University of Genoa, Mechanized Reasoning Group:
ftp -- ftp.mrg.dist.unige.it:/pub/mrg-ftp/
email -- Fausto Giunchiglia
University of Georgia:
ftp -- ai.uga.edu:/pub/ai.reports/
University of Illinois at Urbana:
ftp -- a.cs.uiuc.edu:/pub/dcs
email -- e-amerman@a.cs.uiuc.edu
University of Indiana, Center for Research on Concepts and Cognition:
ftp -- cogsci.indiana.edu:/pub/
email -- helga@cogsci.indiana.edu
University of Kaiserslautern, Germany:
ftp -- ftp.uni-kl.de:/reports_uni-kl/computer_science/
University of Kentucky:
ftp -- ftp.ms.uky.edu:/pub/tech-reports/UK/cs/
University of Massachusetts at Amherst:
email -- techrept@cs.umass.edu
University of Melbourne, Australia,
Computer Vision and Pattern Recognition Laboratory (CVPRL):
ftp -- krang.vis.mu.oz.au:/pub/articles
University of Michigan:
ftp -- ftp.eecs.umich.edu:/techreports
University of North Carolina:
ftp -- ftp.cs.unc.edu:/pub/technical-reports/
University of Pennsylvania:
ftp -- ftp.cis.upenn.edu:/pub/papers/
email -- publications@upenn.edu [email bounced 7/7/93]
USC/Information Sciences Institute:
email -- Sheila Coyazo is the contact. [email
bounced 7/7/93]
University of Toronto:
ftp -- ftp.cs.toronto.edu:/pub/cogrob/ (Cognitive Robotics)
ftp.cs.toronto.edu:/pub/reports/
email -- tech-reports@cs.toronto.edu
University of Virginia:
ftp -- uvacs.cs.virginia.edu:/pub/techreports/cs
University of Western Australia:
ftp -- ciips.ee.uwa.edu.au
Centre for Intelligent Information Processing Systems (CIIPS)
EE Engineering Department
University of Wisconsin:
ftp -- ftp.cs.wisc.edu:/tech-reports
ftp.cs.wisc.edu:/machine-learning
ftp.cs.wisc.edu:/computer-vision
email -- tech-reports-archive@cs.wisc.edu
Some AI authors have set up repositories of their own papers:
Matthew Ginsberg: t.stanford.edu:/u/ftp/papers
----------------------------------------------------------------
Subject: [5-6] Technical resources for/by undergraduate students
Brainsciences http://www.brainsciences.com
A group of students at Brown University have created a web site to
"provide a forum for undergraduates to publish their work. We feature
reports of original research, book reviews, term papers, and other work
in a similar vein."
----------------------------------------------------------------
Subject: [5-7] Where can I get a machine readable dictionary, thesaurus, and
other text corpora?
Linguistic Data Consortium:
The Linguistic Data Consortium was established to broaden the
collection and distribution of speech and natural language data
bases for the purposes of research and technology development in
automatic speech recognition, natural language processing, and
other areas where large amounts of linguistic data are needed. LDC
corpora are the most commonly used in published research.
Information about the LDC is at http://www.ldc.upenn.edu/
Free:
On unix systems, /usr/dict/words is a fine word list.
===========
The Moby Thesaurus (25,000 roots/1.2 million synonyms), Moby Words
(560,000 entries), Moby Hyphenator (155,000 entries), and the Moby
Part-of-Speech (214,000 entries), Moby Pronunciator (167,000
entries with IPA encoding, syllabification, and primary, secondary,
and tertiary stress marks) and Moby Language (100,000 word word
lists in five major world languages) lexical databases are
available at:
http://www.dcs.shef.ac.uk/research/ilash/Moby/
This was once commercial but is now in the public domain.
[thanks to Robert Bechtel]
===========
Roget's 1911 Thesaurus is available by anonymous FTP from the
Consortium for Lexical Research
clr.nmsu.edu:/CLR/lexica/roget-1911 [128.123.1.12]
It is also available from
src.doc.ic.ac.uk:/literary/collections/project_gutenberg/roget11.txt.Z
An old Webster's dictionary is in /text/dict/{DICT.Z,DICT.INDEX.Z}.
Project Gutenberg also has Roget's 1911 Thesaurus. The Project
Gutenberg archive is at mrcnext.cso.uiuc.edu:/pub/etext/. The
Project Gutenberg archive collects public domain electronic books. For more
information, write to Michael S. Hart, Professor of Electronic Text,
Executive Director of Project Gutenberg Etext, Illinois Benedictine
College, 5700 College Road, Lisle, IL 60532 or send email to
hart@vmd.cso.uiuc.edu.
The Online Book Initiative maintains a text repository at
http://obi.std.com:/obi/
The CHILDES project at Carnegie Mellon University has a lot of data of
children speaking to adults, as well as the adult written and adult
spoken corpora from the CORNELL project. Contact Brian MacWhinney
for more information.
The Association for Computational Linguistics (ACL) has a Data
Collection Initiative. For more information, contact Donald Walker at
Bellcore, walker@flash.bellcore.com.
Two lists of common female first names (4967 names) and male first
names (2924 names) are available for anonymous ftp from
ftp.cs.cmu.edu:/user/ai/areas/nlp/corpora/names/
Read the file README first. Send mail to mkant@cs.cmu.edu for more
information.
A list of 110,000 English words (one per line, in ASCII) is
available in the PD1: directory on SIMTEL20 as the
files WORDS1.ZIP, WORDS2.ZIP, WORDS3.ZIP, and WORDS4.ZIP. Although the
list is in MS-DOS files, it can easily be used on other machines (but
first you'll have to unzip the files on a DOS machine). The list
includes inflected forms of the words, such as plural nouns and the
-s, -ed, and -ing forms of verbs; thus the number of lexical stems in
the list is considerably smaller than the total number of word forms.
These files are available via FTP from WSMR-SIMTEL20.ARMY.MIL
[192.88.110.20]. SIMTEL20 files are mirrored on wuarchive.wustl.edu.
The Collins English Dictionary encoded as a Prolog fact base is
available from the Oxford Text Archive by anonymous ftp from
ota.ox.ac.uk:/pub/ota/dicts/1192/ [129.67.1.165]
The Oxford Text Archive includes many other texts, dictionaries,
thesauri, word lists, and so on, most of which are available for
scholarly use and research only. See the files
ota.ox.ac.uk:/pub/ota/textarchive.form
ota.ox.ac.uk:/pub/ota/textarchive.info
ota.ox.ac.uk:/pub/ota/textarchive.list
ota.ox.ac.uk:/pub/ota/textarchive.sgml
for more information, or write to archive@ox.ac.uk, Oxford Text Archive,
Oxford University Computing Services, 13 Banbury Road, Oxford OX2
6NN, UK, call 44-865-273238 or fax 44-865-273275.
Chuck Wooters has extracted the most
likely pronunciation for each of about 6100 words in the hand-labeled
TIMIT database, and made them available by anonymous ftp from
ftp.icsi.berkeley.edu:/pub/speech/TIMIT.mostlikely.Z.
A list of homophones from general American English is available by
anonymous ftp from svr-ftp.eng.cam.ac.uk:/comp.speech/data/ as the file
homophones-1.01.txt. To receive the list by email, send mail to
Evan.Antworth@sil.org. The list was compiled by Tony Robinson.
Sigurd P. Crossland has been compiling
a dictionary of English words, including most common American words,
abbreviations, hyphenations, and even incorrect spellings. The most
recent version is available by anonymous ftp from
wocket.vantage.gte.com:/pub/standard_dictionary/dic-0394.tar.gz
The tar file includes 31 text files, one for each word-length from 2
to 32. The compressed tar file takes up just over 4mb of space, and
includes approximately 870,000 words.
WordNet is an English lexical reference system based on current
psycholinguistic theories of human lexical memory. It organizes nouns,
verbs and adjectives into synonym sets corresponding to lexical
concepts. The sets are linked by a variety of relations. Besides being
of scientific interest,
it makes a handy thesaurus. WordNet is available by anonymous ftp from
clarity.princeton.edu:/pub/
If you retrieve a copy of wordnet by ftp, please send mail to
wordnet@princeton.edu.
Commercial:
The Oxford Text Archive has hundreds of online texts in a wide variety
of languages, including a few dictionaries (the OED, Collins, etc.).
The Lancaster-Oslo-Bergen (LOB), Brown, and London-Lund corpii are also
available from them. For more information, write to Oxford Electronic
Publishing, Oxford University Press, 200 Madison Avenue, New York, NY
10016, call 212-889-0206, or send mail to archive@vax.oxford.ac.uk.
(Their contact information in England is Oxford Text Archive, Oxford
University Computing Service, 13 Banbury Road, Oxford OX2 6NN, UK, +44
(865) 273238.)
Mailing Lists:
CORPORA is a mailing list for Text Corpora. It welcomes information
and questions about text corpora such as availability, aspects of
compiling and using corpora, software, tagging, parsing, and
bibliography. To be added to the list, send a message to
corpora-request@x400.hd.uib.no. Contributions should be sent to
corpora@x400.hd.uib.no.
----------------------------------------------------------------
Subject: [5-8] Where can I get training sets for machine learning algorithms?
UC/Irvine (UCI) AI/Machine Learning Repository:
ftp.ics.uci.edu has a variety of AI-related materials, with a special
focus on machine learning. For example,
ftp.ics.uci.edu:/pub/machine-learning-databases/
contains over 80 benchmark data sets for classifier systems (30mb).
MLnet Machine Learning Archive
MLnet Online Information Service
In 1988 the Special Interest Group on Machine Learning of the German
Society for Computer Science (GI e.V.) decided to establish a library
of PROLOG implementations of Machine Learning algorithms. By 1994 the
library had a sizable collection of GLPed PROLOG software. The site
has grown, and now, according to the webpage it "offers a growing
collection of ML information, datasets, software and pointers to other
ML resources." The homepage is at:
http://www.mlnet.org
Send your contributions to Mathias Kirsten (info@mlnet.org) at the GMD -
German National Research Center, or use the contribution facilities within
the MLnet OiS.
----------------------------------------------------------------
Subject: [5-9] What on-line Journals are there?
[this question is still in progress]
Journal of Artificial Intelligence Research. See [3-2a].
Journal of Machine learning Reasearch. See [3-2n].
---
[ comp.ai is moderated. To submit, just post and be patient, or if ]
[ that fails mail your article to , and ]
[ ask youa news administrator to fix the problems with your system. ]
Section 2 of 2 - Prev - Next
| Back to category general - Use Smart Search |
| Home - Smart Search - About the project - Feedback |
© allanswers.org | Terms of use