Tag: Google

  • The end of media as we know it

    The end of media as we know it

    The creepy tone of the background music sets the stage for this look back at the demise of traditional media as we know it from the perspective of 2014. Amazon, Google, Microsoft, Friendster and the trend towards personalized and automated filters to help manage information flow pull down the Fourth Estate.

    The New York Times becomes a print-only newsletter for the elderly and elite.”

    The ending leaves me cold. Watch the developments over at Pegasus News as they build an alternative to this algorithmic nightmare.

  • Guessing what you’re after

    I just downloaded Firefox 1.0 this morning and with my Noia theme there’s a big blue lollipop thing right next to the address bar. Hover text says, "Type a location in the address field, then click Go"

    It looks like it’s using the Google "I’m feeling Lucky" result which I stayed away from because of the labeling ("feeling lucky? nah, I’m here doing research!"). Once I tried it though, I was amazed at how many times it found exactly what I was looking for.

  • Google Desktop Search

    In a move that took everyone by surprise, Google announced a new downloadable product that installs on your hard drive, indexes your email, Word, Excel, Powerpoint, and AIM chat logs and adds them to the Google Search results window. The expected move was that Google would launch their own, Google-centric browser but they have once again side-stepped popular wisdom and done something that new and fantastic.

    You’ll do a double-take the first time you run a search after installing Google Desktop Search. Up on top of your results, right under the paid search ads, you see links to personal email and files that contain hits on your query. Instead of bringing the web to your desktop, by putting hits on your desktop files into the Google UI it now looks (and feels) like Google has put your desktop onto the web.

    Rael Dornfest explains what’s going on behind the scenes:

    What’s actually going on is that the local Google Desktop server is intercepting any Google web searches, passing them on to Google.com in your stead, and running the same search against your computer’s local index. It’s then intercepting the Web search results as they come back from Google, pasting in local finds, and presenting it to you in your browser as a cohesive whole.

    John Battelle caught up with Marissa Mayer, Google’s director of consumer web products, and found out that the app is only 400k and runs on only 8 MB of RAM. She also says that the relevance algorithm obviously doesn’t use PageRank but does use 150 other proprietary variables (bolding, font size, etc) to determine relevance.

    Danny Sullivan writes in depth about this new tool going on to say that the Google page that you see when you launch Desktop Search is not actually on the web but is being served up by the web server that comes with the app. This is apparent when you see the address of the URL [http://127.0.0.1:4664/&s=400994545] which is a local address.

    Another benefit is the caching so that you can now quickly peek into the contents of a file without having to wait for Excel to fire up. If there are multiple copies in cache, there’s version history which can save you if you’ve overwritten a file using the same name.

    It’s still in beta so I’ll forgive the fact that it only runs on Windows and indexes only AIM chat and Internet Explorer caches but other than that, this is a most impressive product that redefines its category.

  • Google News Forever in Beta

    Why has Google News been in Beta for three years? Because as soon as they try and place ads on the Google News pages to monetize the traffic, they’ll be hit with a barrage of cease & desist letters from publishers around the world says Wired’s Adam Penenberg.

  • The Political Bias of Algorithms

    In a review of the Google and Yahoo news sites in Online Journalism Review, Ethan Zuckerman puts forward a very interesting theory as to why the alternative news sites bubble up to the top of the relevance ranking algorithms at Google News.

    “I think what you’re seeing is an odd little linguistic artifact,” said Zuckerman, former vice president of Tripod.com and now a fellow at Harvard’s Berkman Center for Internet and Society who studies search engines. The chief culprit, he theorized, is that mainstream news publications refer to the senator on second reference as Kerry, while alternative news sites often use the phrase “John Kerry” multiple times, for effect or derision. To Google News’ eye, that’s a more exact search result.

    A second possible factor, Zuckerman said, is that small, alternative news sites have no hesitancy about using “John Kerry” in a headline, while most mainstream news sites eschew first names in headlines. The inadvertent result is that the smaller sites score better results with the search engines.

  • Wonderful Web World

    Wonderful Web World

    As I look for the cross-section of schools and interesting-but-reasonably-priced places to exist (does such a thing exist in the Bay Area?) I found myself wanting for a school district map overlayed on top of a map showing available placed to live. I’ve found pieces of the puzzle:

    SF School District Map
    greatschools.org
    SF Zip Code Map

    If only I could overlay the available listings which you can pull up in realtor.com or apartments.com using a zip code then I would be set.

    In the course of looking for a tool that could tie zip codes to neighborhoods to school districts, I ran across this wonderful site by MIT Media Lab doctoral candidate, Ben Fry. His interactive Zip Code tool is one of the coolest things I’ve seen in awhile.

    UPDATE: Now 10 years later Google Maps has started to layer this information as an extension of their Google Maps service. Check out the mash-up of greatschools.org ratings and Google Maps.

    Check out more mashups at the Google Maps Gallery.

  • Google as a Gateway

    Adam Penenberg writes in Wired that there is a very real financial incentive for the NYT web site to continue to hide its stories behind a subscription wall. A $20 million/year all-you-can-eat royalty agreement with Lexis-Nexis is an awfully hard arrangement to tear up. John Battelle noodles on this idea a bit more and ponders when the attraction of differentiated revenue from individuals, finding stories on their own via Google and other search engines, will outweigh the guaranteed revenue stream from L-N. He also adds:

    What revenue stream accounts for the lion’s share of search’s margin? Advertising. That’s a one legged stool ready to tip over. As the search giants become more and more media companies, they must develop subscription services, and because users won’t want to pay for something they already believe is free (searching) search engines will have to figure out a way to become middlemen to paid content. After all, they own distribution, so they should become…distributors. Were they to execute this service in a scaled and elegant fashion, it might be viewed as a benefit – in many cases, subscribers will get more content for less than they were paying in the past (that’s the benefit of volume).

    Searching for the New York Times – Wired

    Google as a portal to premium content? Haven’t we been there before? One comment to John’s post points out that this has been AOL’s model for the past 10 years. Yahoo has continually tried to push premium services and could easily bundle in targeted content. Northern Light also blazed this trail but flamed out after a failure to bring together enough content.

    There are many ways to get to content and Google is the current flavor of the month. Of greater demand is having a single account that aggregates access fees to each site for a reasonable monthly fee. Why pay nytimes.com and wsj.com separately when you’d rather pay a single bill for unfettered access to these sites and more? Yes, Google has the distribution network but PayPal or American Express might be a better player for a unified subscription account. After setting up unified subscription fees, the next step is working with each of the major content vendors on feeding RSS feeds of their content to the major search engine vendors so that their content begins to move up in the rankings. Portions of the proceeds of the subscription fees could then go to each of the search engine vendors, paid out as a proportion of the amount of traffic they drive to the payment vendor for signup. $40/month sounds about right – we’ll call it a “global media press pass.”

    UPDATE: Cory cuts to the chase

    The NYT’s registration system and expiring pages have doomed them to google-obscurity. Wired News argues that they’ve gone from being the paper of record to a Web-era irrelevancy, and all to protect a Lexis-Nexis agreement and to bring in two to three percent of the digital division’s profits.

    Unlinkable NYT doomed to google-obscurity – boingboing
  • The Google Supercomputer

    One node in the discussion of the internet as a new platform is the meme of Google specifically as this platform. The notion gained legs with announcements of Orkut, the social software site affiliated with Google and later touched a nerve with the announcement of Gmail, Google’s online email service. This discussion was kicked off in April 2004 by Rich Skrenta, CEO of Topix.net in a well cited post, The Secret Source of Google’s Power in a post whose comments section is now outgrown the original post
    . . . expanded by Jason Kottke in his post, GooOS, the Google Operating System
    . . . referenced by Jon Udell in his Strategic Developer column,
    . . . and summed up by Tim O’Reilly

    Gmail is fascinating to me as a watershed event in the evolution of the internet. In a brilliant Copernican stroke, gmail turns everything on its head, rejecting the personal computer as the center of the computing universe, instead recognizing that applications revolve around the network as the planets revolve around the Sun. But Google and gmail go even further, showing that once internet apps truly get to scale, they’ll make the network itself disappear into the universal virtual computer, the internet as operating system.

    Jon Udell later extends his musings in a later column,

    The gigabyte slice of the Google file system available to Gmail beta testers will, in many cases, surpass the testers’ own corporate disk quotas for email.

    Put that way, one can begin to see a world where the Google index is the broader file systems that points to “things out there” where our email, web pages, and social networks are all inputs into that file system. Jon goes on to explain this world where Google owned the operating system and what such a unified file system that continually indexes everything on your local PC could do:

    Bayesian categorization: My SpamBayes-enhanced e-mail program learns continuously about what I do and don’t find interesting, and helps me organize messages accordingly. A systemwide agent that’s always building categorized views of all your content would be a great way to burn idle CPU cycles.

    Context reassembly: When writing a report, you’re likely to refer to a spreadsheet, visit some Web pages, and engage in an IM chat. Using its indexed and searchable event stream, the system would restore this context when you later read or edited the document. Think browser history on steroids.

    Screen pops: When you receive an e-mail, IM, or phone call, the history of your interaction with that person would pop up on your screen. The message itself could be used to automatically refine the query.

    I guess I’m ok with this, so long as it’s not trying to sell me ads based on its findings!

  • Google Search Appliance 2.0

    First launched in 2002, the Google Search Appliance is a rack-mounted unit designed to crawl and index intranet pages for enterprise search. Combined with OS-level integration points such as the Google Deskbar, the appliance is the bridge between an index of your PC hard drive and the internet. While Google has not yet announced an integration into the PC level index space, several third party vendors have announced the ability to add adaptive crawl technologies integrated with Microsoft Windows and Office. Most notable is Lookout which even looks like a Google knock-off.

    The GB-1001 is a rack-mounted two-unit (2U) appliance that can be licensed to search up to 1.5 million documents at a rate of 300 queries per minute.

    Our entry level license indexes up to 150,000 documents and costs $32,000 for a two-year license with hardware, software and technical support all included. Pricing scales upwards based on the number of documents.

    A list of published customers.