IntroductionThere are many acronyms, however, in a given field some words will be used repeatedly to build different acronyms. I wanted to know what words participate the most in building different acronyms in the computer field. So I wrote this program to do so. The program will do the following: - Read a dictionary file (text).
- Analyze it.
- Determine how many times the same word is used in building acronyms.
- Produce the result as an HTML table.
Running The ProgramLook at the file r.bat to see how to run the jar file. The program was written using Java 1.4. You can get a sample dictionary file from: http://www.geocities.com/ikind_babel/babel/babel.htmlResults Running the program against the dictionary file defined above, I got the following output (manually formatted): | Letter | # of Words Starting with Letter | First Word (% repeated) | Second Word (% repeated) | Third Word (% repeated) | Fourth Word (% repeated) | Fifth Word (% repeated) | | A | 890 | Access(%13) | Application(%6) | Architecture(%5) | Advanced(%4) | Address(%4) |
|---|
| B | 338 | Block(%8) | Binary(%7) | Basic(%5) | Bus(%4) | Bit(%4) |
|---|
| C | 976 | Control(%9) | Computer(%7) | Code(%3) | Communications(%2) | Common(%2) |
|---|
| D | 932 | Data(%22) | Digital(%7) | Device(%6) | Disk(%3) | Display(%3) |
|---|
| E | 622 | Electronic(%7) | Exchange(%7) | Extended(%6) | Environment(%5) | End(%4) |
|---|
| F | 444 | File(%16) | Format(%11) | Foreign(%5) | Facility(%3) | Field(%3) |
|---|
| G | 223 | Graphics(%20) | Global(%8) | Group(%8) | General(%8) | Gateway(%4) |
|---|
| H | 161 | High(%22) | Host(%6) | Hardware(%5) | Hexadecimal(%4) | Head(%4) |
|---|
| I | 787 | Interface(%18) | Information(%14) | Internet(%10) | Integrated(%4) | Identification(%3) |
|---|
| J | 72 | Jump(%36) | Java(%26) | Job(%9) | Joint(%9) | Jet(%4) |
|---|
| K | 72 | KEYBoard(%31) | Kit(%19) | Key(%15) | Keying(%5) | Keyboard(%5) |
|---|
| L | 541 | Language(%22) | Link(%6) | Line(%5) | Logic(%4) | Local(%4) |
|---|
| M | 781 | Management(%13) | Memory(%9) | Manager(%4) | Message(%3) | Microsoft(%3) |
|---|
| N | 387 | Network(%39) | National(%8) | Number(%6) | Not(%6) | NetWare(%2) |
|---|
| O | 377 | Object(%11) | Open(%10) | Of(%10) | Operating(%8) | Output(%3) |
|---|
| P | 910 | Protocol(%14) | Processing(%4) | Program(%3) | Programming(%3) | Per(%3) |
|---|
| Q | 48 | Query(%27) | Quality(%20) | Quad(%10) | Queuing(%6) | Quattro(%4) |
|---|
| R | 542 | Remote(%7) | Register(%4) | Request(%4) | Resource(%3) | Random(%3) |
|---|
| S | 1279 | System(%20) | Service(%5) | Standard(%3) | Server(%3) | Systems(%3) |
|---|
| T | 550 | Technology(%7) | Time(%6) | Terminal(%6) | Table(%5) | Transfer(%4) |
|---|
| U | 214 | Unit(%26) | User(%19) | Universal(%14) | Unix(%4) | Uniform(%2) |
|---|
| V | 256 | Virtual(%29) | Video(%19) | Voice(%6) | Visual(%6) | Variable(%3) |
|---|
| W | 145 | Windows(%17) | Web(%11) | Word(%11) | Wide(%6) | Write(%5) |
|---|
| X | 5 | XML(%40) | Xerox(%40) | | | |
|---|
| Y | 14 | You(%35) | Year(%28) | Your(%14) | Yet(%14) | |
|---|
| Z | 19 | Zero(%63) | Zone(%10) | | | |
|---|
Downloads |
ď Abdul Habra, Mar 28, 2011 11:06 AM
ď Abdul Habra, Mar 28, 2011 11:06 AM
|