Archive for the ‘Uncategorized’ Category
Google Services everyone Uses
Posted by Yuvi in Uncategorized on June 20th, 2008
Chime in with the list of Google Services you use – I will compile them into a post with good looking charts (It’s all about good looks, no?) if I get enough responses
You can leave your response at FriendFeed or here (if you are not into those kinda kinky things)
First Indian on the Techmeme Leaderboard: Yuvi ;)
Posted by Yuvi in Uncategorized on May 30th, 2008
I had a sneaky suspicion…
…which I confirmed a few minutes ago:
I’m the first Indian on the Techmeme Leaderboard! Yay!
You can grab the CSV file showing all the sites that have been on the Techmeme Leaderboard, for “atleast” one day. I checked all the sites manually, and The Statbot is the only Indian site there. Yay!
I’m making a big fuss of this small thing, simply because this came at a right time – I could present this too as “proof” of extracurricular activities when I’m submitting my College Application to SSN.
Update: No, Om Malik doesn’t qualify – He’s an American Indian (Update #2: NO, I didn’t mean a Native American – In India, Indian-born American Citizens are called American Indians
)
Unbelievable: The Statbot in the Techmeme Leaderbaord!
Posted by Yuvi in Uncategorized on May 28th, 2008

Never thought I’d be there this fast, eh?
Thanks to LouisGray for noticing it way before I did. He must be part of a collective working for him…
Thanks fly out to all people who made this possible (Rob Lagesse, Louis Gray, Sudar, Rory, and those I missed)
This goes into my Extra-Curricular-Activities-That-Can-Help-Me-Get-Into-Good-College folder
TechCrunch Statistics A-W
Posted by Yuvi in Blogs, Uncategorized on May 27th, 2008
Why A-W? Because I’m pretty sure I missed some
Do point them out so that I can do a “TechCrunch W-Z Statistics” post too
Quick Stats
- Total of 7007 posts….
- …spread over 1079 days, or just under 3 years…
- …with a total of 1,977,710 words…
- …at an average of 6.5 posts a day…
- …with 282.2 words a post…
- …receiving 228,449 comments…
- …from 56,292 unique commentators…
- …with 18,440 outbound links…
- …to 4641 sites…
- …at an average of 4 links to every site
Growth
Looking at Techcrunch’s growth patterns…
Posting Frequency

Posting frequency almost doubled after May 2007, and has been increasing ever since. I believe that hiring more bloggers to be part of TechCrunch resulted in the, uh, “explosion”. Gabe Rivera explained that this(ie going Pro) was one of the reasons TC was given more weight on Techmeme.
Post Size

Although the frequency of posting has gone significantly up, their length remains almost constant (notice the almost flat Black Trendline)
Commenting Growth

The comments increased by quite a bit after Jan 06, but seem pretty constant after that.
Top Links
Here is the list of the most linked to domains from TechCrunch:
| Rank | Site | Links | Presence |
| 1 | http://crunchbase.com |
1574 |
8.54 |
| 2 | http://techcrunch.com |
670 |
3.63 |
| 3 | http://google.com |
429 |
2.33 |
| 4 | http://crunchgear.com |
343 |
1.86 |
| 5 | http://yahoo.com |
298 |
1.62 |
| 6 | http://blogspot.com |
258 |
1.40 |
| 7 | http://flickr.com |
255 |
1.38 |
| 8 | http://facebook.com |
220 |
1.19 |
| 9 | http://wikipedia.org |
162 |
0.88 |
| 10 | http://technorati.com |
155 |
0.84 |
| 11 | http://typepad.com |
124 |
0.67 |
| 12 | http://nytimes.com |
115 |
0.62 |
| 13 | http://gigaom.com |
110 |
0.60 |
| 14 | http://youtube.com |
108 |
0.59 |
| 15 | http://digg.com |
96 |
0.52 |
| 16 | http://myspace.com |
92 |
0.50 |
| 17 | http://twitter.com |
79 |
0.43 |
| 18 | http://amazon.com |
75 |
0.41 |
| 19 | http://wsj.com |
75 |
0.41 |
| 20 | http://blogs.com |
75 |
0.41 |
| 21 | http://techmeme.com |
73 |
0.40 |
| 22 | http://live.com |
71 |
0.39 |
| 23 | http://talkcrunch.com |
70 |
0.38 |
| 24 | http://microsoft.com |
70 |
0.38 |
| 25 | http://crunchboard.com |
70 |
0.38 |
Here’s the chart showing the percentage of links to TechCrunch properties (Crunchboard, CrunchBase, TechCrunch, CrunchGear, etc) and others:

Pretty good
I’ll leave you to draw your own conclusions…
Distribution of links to sites seem to follow a power-law (as most stuff online today seems to). [Note: Power law distribution, in simplified terms, means that a few sites have a lot of links, and a lot of sites have few links]

Note that the tail is pretty much indefinitely long, since I’ve included only the Top 100 sites here…
Posting Frequency Distribution
The posting frequency appears skewed towards the left:

Most days have 1-5 posts, with 2 posts being the most common. 36 days had no posts, and 29 days had more than 20 posts.
Commenting Frequency Distribution
Comments seem to follow a Normalish distribution, skewed towards the left (btw, question to better stats buffs than me – is it really normal if it is skewed?)

Most posts have 5-40 comments. The big pole at the end is not exactly at 100, but it’s because there are 252 posts with more than 100 comments. Large number of outliers, eh?
Links Distribution
The graph looks pretty weird, but it’s how blogs work if you think ’bout it:

Mosts posts have 1-3 links, while there are more posts with 0 links than 10. I think this is pretty normal for most blogs.
Top Commentators
Here’s the list of Top 25 people who have left a comment on TechCrunch:
| Rank | Commentator | Comments | Presence |
| 1 | Michael Arrington |
3195 |
1.40% |
| 2 | Chris |
1649 |
0.72% |
| 3 | Andrew |
1131 |
0.50% |
| 4 | Duncan Riley |
1110 |
0.49% |
| 5 | David Mackey |
1004 |
0.44% |
| 6 | John |
923 |
0.40% |
| 7 | Mike |
900 |
0.39% |
| 8 | Steve Ballmer |
869 |
0.38% |
| 9 | Alex |
834 |
0.37% |
| 10 | Matt |
802 |
0.35% |
| 11 | Jon |
777 |
0.34% |
| 12 | Jason |
669 |
0.29% |
| 13 | David |
659 |
0.29% |
| 14 | Paul |
641 |
0.28% |
| 15 | Dave |
614 |
0.27% |
| 16 | Steve |
604 |
0.26% |
| 17 | Allen Stern |
604 |
0.26% |
| 18 | Josh |
594 |
0.26% |
| 19 | Peter |
581 |
0.25% |
| 20 | Tom |
557 |
0.24% |
| 21 | Mark |
541 |
0.24% |
| 22 | Sean |
494 |
0.22% |
| 23 | James |
488 |
0.21% |
| 24 | Alaska Miller |
485 |
0.21% |
| 25 | Marshall Kirkpatrick |
476 |
0.21% |
(Sorry for the lack of URLs – I put in everyone who was found via Google. If you’re here, and want your url, just leave a comment)
Just as how TechCrunch links back to itself the most, Mike is the highest commentator
p.s. which is the correct term to call a person who leaves comments – commentor or commentator?
Trivia
My most favorite section!
However, this has been cut short since TechCrunch doesn’t provide the “time” of posting nor has categories or tags (why is that?)

As usual Saturday is really the “day off”, with less posts than even Sunday. Friday, as usual, is “slow news day”

Saturday’s posts are longest, and Wednesday’s are shortest. Let’s wait till Steve Gillmor’s ramblings make Sunday’s posts longer
(On a slightly pseudo-serious note, has anyone been able to understand anything Steve Gillmor says?)
Conclusion
So folks, that’s the end of it. I would release the data set (An 180 meg XML file with all the content and comments) for download, but am not sure what Mike and co would feel like. If I get-go from them, then I’ll make a post releasing the files. Till then: Any queries? Leave a comment!
SideNote: College Admissions
Things will be relatively quiet here for a week ‘coz I’m looking into college admissions. You can help too! The college I’m hoping to get into has a history of accepting students with a history of good extra-curricular activities. Now, if I get lotsa links here, my Technorati Rank is goanna improve and put up my chances of sneaking into the college so that I can feel at home with the rest of the geeks there
Introducing The Statbot
Posted by Yuvi in Uncategorized on May 1st, 2008
Welcome to the Statbot. I’m Yuvi, a 17 year old guy, and a stats freak by accident who loves it now. I’ve previously published stats about Engadget, Scoble(Linkblog), Raymond Chen, Matt Cutts, Louis Gray, TechCrunch, Digg & Techmeme. My personal blog is here, and this place is exclusively for the fun stats that I generate often. Expect stats about online “public” behavior of various popular folks on twitter, friendfeed, their blogs, flickr, etc. Also, you’ll see stats about communities - The Digg community, the Wikipedia one, the slashdotters, “farkers“, the folks who leave comments at Engadget, those who answer questions on Yahoo answers, the folks who writemaintain Firefox, those who complain at the MSDN forums - the list is endless. Wherever there is a community, I’ll measure it.
Also, if you need custom analysis of your own website/twitter/friendfeed/whatever, I can do that for a small price as well. This service will be available only from next Month onwards though (June 08).
You will also notice a few “unofficial” leaderboards popping up here in the future. Stay tuned so that you can get bragging rights.
Once I pass 50,000 “pageviews”, I’ll start accepting sponsors. That will enable me to satisfy “my” stats addiction (and yours too!), undertaking several projects that will require infrastructure (mainly bandwidth) which are out of my reach now. Also, I’m college bound, so any financial gain I make is going to be extremely useful, both for my current Indian college education and for my ambitious US Postgrad program.
A coupla thank-yous, in no particular order: Rob Lagesse, Louis Gray, Sudar, Rory, Sriram, Chuck and ofcourse my dad
Sorry to the many people I’ve missed out - you know who you are!
If you’re also an RSS Addict like me, you can subscribe to the feed. Expect posts at regular intervals.
(Technical Note: The app that generates stats is named GoodFather, after Rob Lagesse. It consumes XML, generated by a suite of applications collectively named Avee (named for my friend Avantika) GoodFather is pure VB9, while Avee is a collection of VB9 and Python (for larger sites, like Digg) scripts. Flames on my choice of platforms welcome
)
Comments? Suggestions? Advice? Flames? All welcome. There’s a dedicated suggestions page as well.
