Å·±¦ÓéÀÖ

Support for Indie Authors discussion

648 views
Author Resources-No New Posts > Word frequency tool

Comments Showing 1-50 of 110 (110 new)    post a comment »
« previous 1 3

message 1: by J.D. (new)

J.D. Kaplan | 47 comments So in my first book I discovered that once I latch onto a word that I liked I sometimes tended to overuse it. I had a hard time finding anything on the web to help me manage this situation so I wrote something myself and I'm making it available for others to use on my website.

It's available on my website at

It not only counts word frequency (which is configurable in terms of words to ignore and minimum count to see in results) it also counts what I call 'hotspots' which are places where a word shows up more than 3 times in close proximity.

This application is ENTIRELY written in javascript and has no contact with any server, so your content never leaves your machine. This means that no one is seeing your content, or the results of the analysis.

You can easily see the javascript via your web browser and capture it and have it evaluated to convince yourself I am being truthful--I would want to be extremely careful with my content as well.

While I encourage you to use it via my website, he entire thing is on an MIT license, meaning you can do what you want with it. I will be exposing the source code publicly on github in the near future. That link will be available on the page once I get it established. If you're a code monkey like me feel free to engage it as open source (fork and pull requests, issues, etc)

If you decide to host it yourself, I'd appreciate a mention and link to my website. If you make changes, I'd love to know what they are.

If you find issues, enhancement ideas or confusion please let me know via email at [email protected]


message 2: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
J.D. wrote: "So in my first book I discovered that once I latch onto a word that I liked I sometimes tended to overuse it. I had a hard time finding anything on the web to help me manage this situation so I wro..."

That sounds cool J.D. Can't wait to get home and give it a shot.


message 3: by Christina (new)

Christina McMullen (cmcmullen) That's cool! I am uber guilty of the hotspots and having characters who seem to be in a perpetual state of huffing and sighing. :)


message 4: by [deleted user] (new)

Mine are always frowning at something someone said. "He/she frowned." I usually clean all that out in the edits, though.


message 5: by J.D. (new)

J.D. Kaplan | 47 comments I have this sneaking suspicion that I could edit my manuscripts 100x and still find things I want to change every time through. The tool helps me to turn this into a diagnostic step rather than another opportunity to do a huge rewrite :)


message 6: by Charles (new)

Charles Hash | 1054 comments Ken wrote: "Mine are always frowning at something someone said. "He/she frowned." I usually clean all that out in the edits, though."

I used to play MUDs back in the 90s and have that problem too.


message 7: by A.E. (new)

A.E. Hellstorm (aehellstorm) | 196 comments Wow, this is awesome, JD. I think we all are guilty to have favorite darling words that we need to kill every once in a while. Mine are definitely smile/frown/look. *sigh* ;-)


message 8: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
A.E. wrote: "I think we all are guilty to have favorite darling words that we need to kill every once in a while. "

I'm curious what mine are. Can't wait to get home and find out. *glances at the clock* Ten hours down, eighteen to go...

I'm betting my darling words are "coddlepocks" and "swiggledown". I use them way too much. But, doesn't everyone?

No, like most everyone, I'm guessing it's like "smirked", "smiled", "frowned", "winced" and so on.


message 9: by Christina (new)

Christina McMullen (cmcmullen) Good gravy, Dwayne! Twenty eight hour shifts???


message 10: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
Christina wrote: "Good gravy, Dwayne! Twenty eight hour shifts???"

Yeah, luckily this wasn't one of those seventy-two hour shift weekends.


message 11: by J.D. (new)

J.D. Kaplan | 47 comments I found that all my characters were smiling WAY too much :) they're much more stoic now.


message 12: by Christina (new)

Christina McMullen (cmcmullen) Apparently mine grin and smirk. This could turn obsessive. :)


message 13: by A.E. (new)

A.E. Hellstorm (aehellstorm) | 196 comments Thesaurus.com is my best friend, no doubt about it. :-)


message 14: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments Hmm...It detects hotspots but gives no way to find where they are in the text, which makes the tool seem of limited use.

Most of my repeated words were the main character names and words like "had" "what" "that" and other common ones. Didn't notice any words that really bothered me.

But still, without knowing where the clusters happen, I'd find it a real slog to make corrections based on this.


message 15: by J.D. (new)

J.D. Kaplan | 47 comments noting the locations of the hotspots is something I've been working on. it's a tricky proposition as I'd really only be able to note it in terms of word counts. Without some understanding of manuscript structure that's about as fine grained as it can be.

Also, while at the end of my "process" I'll dump the whole manuscript in and see what it shows me, I also use it during revisions on single chapters at a time. Makes it a lot easier to find the offending sections.


message 16: by Charles (new)

Charles Hash | 1054 comments I used the f word like 70 times.


message 17: by Hayden (new)

Hayden Linder (haydendlinder) | 86 comments Charles wrote: "I used the f word like 70 times."

Seriously?! How many times can you describe something as "F"reudian?

Looking forward to giving this tool a spin. Thanks for creating this JD.


message 18: by Dawn (new)

Dawn Rezac | 2 comments J.D. wrote: "So in my first book I discovered that once I latch onto a word that I liked I sometimes tended to overuse it. I had a hard time finding anything on the web to help me manage this situation so I wro..."


message 19: by Dawn (new)

Dawn Rezac | 2 comments That sounds like an amazing tool and I would love to hear more about it. [email protected] I know my major hotspot is using she and her too much in a first person narrative. I had an editor but it didn't seem to bother them. I am still trying to include my book on Good Reads and may be a few days yet.


message 20: by Owen (new)

Owen O'Neill (owen_r_oneill) | 1509 comments Interesting. What's the word limit for this tool?


message 21: by J.D. (new)

J.D. Kaplan | 47 comments @Owen: it's configurable. I have it defaulting to 5 but when I do my entire manuscript at once I usually raise that a bit. You can also add words that you want to ignore (for example, if you know you use 'bombastic' a lot but have decided that's okay you can have the application not count it or display results.)


message 22: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
Charles wrote: "I used the f word like 70 times."

Why do you write about frogs so much?


message 23: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments Dwayne wrote: "Charles wrote: "I used the f word like 70 times."

Why do you write about frogs so much?"


Isn't Charles a French name?


message 24: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments J.D. wrote: "@Owen: it's configurable..."

I think Owen meant the word count of the text you put in. I put in about 75,000 words and it ran pretty quickly.


message 25: by Owen (new)

Owen O'Neill (owen_r_oneill) | 1509 comments J.D. wrote: "@Owen: it's configurable. I have it defaulting to 5 but when I do my entire manuscript at once I usually raise that a bit. You can also add words that you want to ignore (for example, if you know y..."

Micah is correct -- I meant word count on the text input. I was having trouble getting results for anything over 2-3000 words. It could be this machine. Firefox gets weird sometimes.


message 26: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments New novel...

45 f**k
14 f**ked
39 f**king
48 sh*t
8 bullsh*t
13 Jesus
5 damn
36 hell
6 a*s
11 a-hole
6 b**tard
16 b*tch (probably in the "son of a" kind)
=======
247 invectives
74,791 words
0.33% invectives

;D


message 27: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
Micah wrote: "New novel...

45 f**k
14 f**ked
39 f**king
48 sh*t
8 bullsh*t
13 Jesus
5 damn
36 hell
6 a*s
11 a-hole
6 b**tard
16 b*tch (probably in the "son of a" kind)
=======
247 invectives
74,791 words
0.33% ..."


You sure hit that asterisk key a lot.


message 28: by J.D. (new)

J.D. Kaplan | 47 comments Owen wrote: "Micah is correct -- I meant word count on the text input. I was having trouble getting results for anything over 2-3000 words. It could be this machine. Firefox gets weird sometimes."

Ah, sorry about that. So I've run it a bunch with around 120k words and it's been fairly performant. I run it in Chrome. do you get some kind of javascript error when you run into this?

I really appreciate all the feedback--it's always a challenge to QA your own code.


message 29: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments BTW, I ran it on Chrome as well.


message 30: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments Dwayne wrote: "You sure hit that asterisk key a lot."

I know. I get a lot of grief about that from the anti-asterisk social warriors out there. **sigh** (oops, did it again!)


message 31: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
Micah wrote: "I know. I get a lot of grief about that from the anti-asterisk social warriors out there. **sigh** (oops, did it again!)"

U* w**h asks; l*** **r s*m*-**lo** too!


message 32: by Owen (last edited May 10, 2015 08:15AM) (new)

Owen O'Neill (owen_r_oneill) | 1509 comments J.D. wrote: "do you get some kind of javascript error when you run into this?"

Nothing Firefox reports (ver 34.0.5). The debugger came up with nothing. The script is merely unresponsive. (When I dump in 75K words, I don't see any hit at all on the CPU -- not sure if I should.) The upper limit seems to be ~4K words. There's no evidence it's even recognizing the input, once it exceeds that. The only thing that's interesting there is that the limit might be ~32,000 characters. I wonder if FF limits the input to a script?

Anyway, I wouldn't worry about it. This version and/or installation of Firefox does lots of screwy things.


message 33: by J.D. (last edited May 10, 2015 09:49AM) (new)

J.D. Kaplan | 47 comments Next time I play with the script I'll test it in firefox. :)

Good catch, though. thanks for taking a look at it!


message 34: by J.D. (new)

J.D. Kaplan | 47 comments I just wanted to dump this link to the repository for anyone that is interested in seeing, using or participating in that. It's on bitbucket.org, not github.com:




message 35: by Christina (new)

Christina McMullen (cmcmullen) I've pinned this as it is a great resource for authors. Thanks, JD!


message 36: by J.D. (new)

J.D. Kaplan | 47 comments Thanks :) I'm glad people find it helpful


message 37: by Dwayne, Head of Lettuce (new)

Dwayne Fry | 4434 comments Mod
J.D. wrote: "So in my first book I discovered that once I latch onto a word that I liked I sometimes tended to overuse it. I had a hard time finding anything on the web to help me manage this situation so I wro..."

I'm home. It's... way too much fun, J.D. I will probably never be able to write again. I will spend all my free time playing with this!

It doesn't seem to work well on Firefox, but it works fine on Chrome. (I think I saw a comment from Owen or Micah saying this while I was at work, but I just wanted to back it up).


message 38: by J.D. (new)

J.D. Kaplan | 47 comments I'm really glad you're enjoying it :)


message 39: by Sherri (new)

Sherri Bobzien | 11 comments Christina wrote: "That's cool! I am uber guilty of the hotspots and having characters who seem to be in a perpetual state of huffing and sighing. :)"

There's a lot of "yelling" in my book. Hey, people yelled a lot, what can I say.


message 40: by C.B., Beach Body Moderator (new)

C.B. Archer | 1090 comments Mod
Sherri wrote: "Hey, people yelled a lot, what can I say."

Loud things? ;)


message 41: by Dwayne, Head of Lettuce (last edited May 11, 2015 09:55AM) (new)

Dwayne Fry | 4434 comments Mod
J.D. wrote: "I'm really glad you're enjoying it :)"

I am. And the number one word I use a lot is... "said". No matter what story I put through your machine, "said" is, by far, the most used word. I am surprised I didn't expect that, but with my tales being so dialogue heavy, it makes a lot of sense.


message 42: by C.B., Beach Body Moderator (new)

C.B. Archer | 1090 comments Mod
Dwayne wrote: "I am. And the number one word I use a lot is... "said". No matter what story I put through your machine, "said" is, by far, the most used word. ..."

Interesting fact Dwayne: In my novel the word 'said' is never actually said. *gasp!*


message 43: by Riley, Viking Extraordinaire (new)

Riley Amos Westbrook (sonshinegreene) | 1510 comments Mod
CB wrote: "Dwayne wrote: "I am. And the number one word I use a lot is... "said". No matter what story I put through your machine, "said" is, by far, the most used word. ..."

Interesting fact Dwayne: In my n..."


Or so he said...


message 44: by C.B., Beach Body Moderator (new)

C.B. Archer | 1090 comments Mod
I will double check Riley, it might have snuck in there somewhere since the last time I looked and therefore must be deleted.

I hope it isn't, Said was one of the flag words that wasn't allowed to be in there! :)
I do appear to have a serious problem with the phrase 'text box' though... that shows up like 3000 times!


message 45: by J.D. (new)

J.D. Kaplan | 47 comments @Riley: Well played, sir. Well played...


message 46: by Micah (new)

Micah Sisk (micahrsisk) | 1042 comments CB wrote: "I hope it isn't, Said was one of the flag words that wasn't allowed to be in there..."

Hmm. You know, you might want to consider putting some dialog. [joke]

Why, oh why, would "said" be a flag word?


message 47: by J.D. (new)

J.D. Kaplan | 47 comments For those who were wondering, these are the words that are in the built in word exclusion list:

this.excludes = ["is", "it", "in", "the", "and", "a", "to", "but", "not", "or", "of", "at", "with", "was", "i", "she", "her", "my", "that", "on", "me", "for", "into", "out", "were", "you", "he", "me", "his", "on", "for", "him", "out", "we", "up"]

I just grew that little list while using and developing the tool--it doesn't reflect some attempt at a grand bit of universal knowledge :)

You can choose not to use this list by unchecking that checkbox in the configuration section at the top.

Even if you like this list as excludes, you probably have others you want to exclude--you can add those in the first text field.


message 48: by C.B., Beach Body Moderator (new)

C.B. Archer | 1090 comments Mod
Micah wrote: "Why, oh why, would "said" be a flag word"

Normally 'said' wouldn't be a flag word. I am actually quite a fan of said!

However, my novel is set inside a multi-player video game. No one actually talks as they are technically typing in what they say, and then are reading the 'Text Box' entries of others. No one ever says something like 'Oh, but you said that thing and...', they would instead type "Oh, but I just read what you typed and...'

It eliminated all instances of 'said', 'say', 'and that ilk'. The characters show emotion mostly through what they actually type, but can do special yelling, whisper, and interruption typing actions to balance it out and keep it from getting stale. :) It reads a lot like a play.

Normally I wouldn't restrict words like that, but it was to serve the narrative as a whole and keep the story feeling like a video game.

I also restricted most real life time references, measurements, and various other things that reminded the reader and characters that the real world does exist outside the game they are playing. I exchanged them to video game terms to keep it feeling internally consistent with the world.

It was actually very fun!


message 49: by K. (new)

K. Kidd | 49 comments J.D. wrote: "So in my first book I discovered that once I latch onto a word that I liked I sometimes tended to overuse it. I had a hard time finding anything on the web to help me manage this situation so I wro..."

J.D. - Thanks for sharing this word frequency tool. What a great resource. Guilty as charged on overusing the same word. "Paused" seems to be my favorite word. Who knew?


message 50: by J.D. (new)

J.D. Kaplan | 47 comments My pleasure. I'm glad you're finding it useful


« previous 1 3
back to top