Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Programming > Java Machine > Re: Keyword ext...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 2 of 9 Topic 745 of 843
Post > Topic >>

Re: Keyword extractor's source code....where I can find it???

by glen herrmannsfeldt <gah@[EMAIL PROTECTED] > Jan 11, 2007 at 05:57 PM

giugy wrote:

> Someone knows where I can find the Keyword Extractor source code
> written in java? A software that analyzes a text and extract the
> keyword of the text (the most present words in the text....for example
> the word "hello" is present forty times,the word "thanks" is present
> thirty times....).

> I need to see the software's source code written in java in order to
> understand as it works....

It is very easy to write in Java.

First read a line and extract words using StringTokenizer.  Then
use a Hashtable to find out if you have seen that word before.
If so, increment a counter.  If not, add it to the Hashtable with
a count of 1.   I store a long[] in the hashtable for convenience
in incrementing, but others will do something different.

One trick, though.  After you extract words with StringTokenizer and
find they are not in the table, create a new String to store the
reference in the hash table.  If you don't it will take up too much
memory, as the whole line of characters is stored for each word.

After you finish reading the file, go through the Hashtable,
extract words and counts, and print them out.

It should not take long at all to write.

-- glen
 




 9 Posts in Topic:
Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-11 07:59:17 
Re: Keyword extractor's source code....where I can find it???
glen herrmannsfeldt <g  2007-01-11 17:57:33 
Re: Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-16 09:20:23 
Re: Keyword extractor's source code....where I can find it???
glen herrmannsfeldt <g  2007-01-16 23:15:15 
Re: Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-16 09:20:25 
Re: Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-17 01:15:50 
Re: Keyword extractor's source code....where I can find it???
glen herrmannsfeldt <g  2007-01-17 01:25:31 
Re: Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-17 01:15:51 
Re: Keyword extractor's source code....where I can find it???
"giugy" <mat  2007-01-17 01:15:50 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Wed Dec 3 14:33:40 CST 2008.