Developing a Computational Word Learner
The CHILDES database: http://childes.psy.cmu.edu/
The TalkBank database: http://www.talkbank.org/
The WordNet database: http://wordnet.princeton.edu/
Annotated data for social cues by Michael Frank: corpus
You can find a prototype here:
Prototype
To run the code:
java -jar LearningMeaningsXXX.jar -Xmx512m
And some documentation: Documentation