This is a text-only version of the following page on https://raymii.org:
---
Title       :   Complete word count analysis of Security Now, episode 1 trough 370.
Author      :   Remy van Elst
Date        :   09-09-2012
URL         :   https://raymii.org/s/articles/Security_Now_Word_Analyzer.html
Format      :   Markdown/HTML
---



Security Now is a podcast by Leo Laporte and Steve Gibson released on the
Twit.tv network.

Steve pays to get the podcast transcribed, and the files are up over are
[grc.com][1].

<p class="ad"> <b>Recently I removed all Google Ads from this site due to their invasive tracking, as well as Google Analytics. Please, if you found this content useful, consider a small donation using any of the options below:</b><br><br> <a href="https://leafnode.nl">I'm developing an open source monitoring app called  Leaf Node Monitoring, for windows, linux & android. Go check it out!</a><br><br> <a href="https://github.com/sponsors/RaymiiOrg/">Consider sponsoring me on Github. It means the world to me if you show your appreciation and you'll help pay the server costs.</a><br><br> <a href="https://www.digitalocean.com/?refcode=7435ae6b8212">You can also sponsor me by getting a Digital Ocean VPS. With this referral link you'll get $100 credit for 60 days. </a><br><br> </p>


I decided to run my [analyzer][3] over the complete podcast text archive. This
is from episode 001 to 371.

#### Get the files:



   for i in {001..371}; do curl http://www.grc.com/sn/sn-${i}.txt >> sn.txt; echo $i; done


#### Clean the files up:



   cat sn.txt | LC_CTYPE=C tr -cd '[:alnum:] [:space:]' > csn.txt


#### Analyze the text file:



   cat csn.txt | LC_CTYPE=C tr [:space:] '\n' | grep -v "^\s*$" | sort | uniq -c | sort -bnr > count-combined.txt


#### Result:



   ed count-combined.txt
   461930
   1,20np
   1       65548 the
   2       49919 to
   3       42284 that
   4       40759 STEVE
   5       40065 I
   6       39496 a
   7       35321 of
   8       31706 and
   9       30845 it
   10      29930 is
   11      24634 you
   12      22213 And
   13      20365 in
   14      16467 this
   15      14406 was
   16      13811 So
   17      13761 its
   18      13711 for
   19      12847 have
   20      11599 on


[Full result][4]

### Steve only



   cat sn.txt | grep "STEVE:" > stonly.txt

   cat stonly.txt | LC_CTYPE=C tr -cd '[:alnum:] [:space:]' > stonlyclean.txt

   cat stonlyclean.txt | LC_CTYPE=C tr [:space:] '\n' | grep -v "^\s*$" | sort | uniq -c | sort -bnr > sto.txt


#### Result



   ed sto.txt
   461930
   1,20np
   1       65548 the
   2       49919 to
   3       42284 that
   4       40759 STEVE
   5       40065 I
   6       39496 a
   7       35321 of
   8       31706 and
   9       30845 it
   10      29930 is
   11      24634 you
   12      22213 And
   13      20365 in
   14      16467 this
   15      14406 was
   16      13811 So
   17      13761 its
   18      13711 for
   19      12847 have
   20      11599 on


[Steve only][5]

### Leo Only



   cat sn.txt | grep "LEO:" > leoonly.txt

   cat leoonly.txt | LC_CTYPE=C tr -cd '[:alnum:] [:space:]' > leoonlyclean.txt

   cat leoonlyclean.txt | LC_CTYPE=C tr [:space:] '\n' | grep -v "^\s*$" | sort | uniq -c | sort -bnr > leoc.txt


#### Result



   ed leoc.txt
   367236
   1,20np
   1       40349 LEO
   2       30161 the
   3       25301 to
   4       24623 I
   5       23060 a
   6       19027 you
   7       17115 it
   8       16441 that
   9       15115 of
   10      13676 and
   11      12256 is
   12      9785 in
   13      8689 And
   14      8282 this
   15      7633 have
   16      7552 on
   17      7094 for
   18      6492 its
   19      6032 do
   20      5922 know


[Leo only][6]

  [1]: http://grc.com
  [2]: https://www.digitalocean.com/?refcode=7435ae6b8212
  [3]: https://raymii.org/s/articles/Word_occurrence_counter_and_analyzer.html
  [4]: /s/inc/downloads/securitynow/sn-full.txt
  [5]: /s/inc/downloads/securitynow/sn-steve.txt
  [6]: /s/inc/downloads/securitynow/sn-leo.txt

---

License:
All the text on this website is free as in freedom unless stated otherwise.
This means you can use it in any way you want, you can copy it, change it
the way you like and republish it, as long as you release the (modified)
content under the same license to give others the same freedoms you've got
and place my name and a link to this site with the article as source.

This site uses Google Analytics for statistics and Google Adwords for
advertisements. You are tracked and Google knows everything about you.
Use an adblocker like ublock-origin if you don't want it.

All the code on this website is licensed under the GNU GPL v3 license
unless already licensed under a license which does not allows this form
of licensing or if another license is stated on that page / in that software:

   This program is free software: you can redistribute it and/or modify
   it under the terms of the GNU General Public License as published by
   the Free Software Foundation, either version 3 of the License, or
   (at your option) any later version.

   This program is distributed in the hope that it will be useful,
   but WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
   GNU General Public License for more details.

   You should have received a copy of the GNU General Public License
   along with this program.  If not, see <http://www.gnu.org/licenses/>.

Just to be clear, the information on this website is for meant for educational
purposes and you use it at your own risk. I do not take responsibility if you
screw something up. Use common sense, do not 'rm -rf /' as root for example.
If you have any questions then do not hesitate to contact me.

See https://raymii.org/s/static/About.html for details.