Monthly Archives: February 2009

Drowning in the Deep Web

The concept of the semantic web is not terribly new. As humans we provide direction to the search engine to retrieve what we’re seeking. The semantic web would go beyond the direction we give it to form its own conclusions and deduce what we want. According to Wikipedia:

semantic web is a vision of information that is understandable by computers, so that they can perform more of the tedious work involved in finding, sharing, and combining information on the web.

drowning_small1

It’s difficult for search engines to process data cognitively – they’re only machines after all. The deductive reasoning research inherent in the semantic web are a long way out. Or so I thought until I read this article.

The challenges that the major search engines face in penetrating this so-called Deep Web go a long way toward explaining why they still can’t provide satisfying answers to questions like “What’s the best fare from New York to London next Thursday?” The answers are readily available — if only the search engines knew how to find them.

Now a new breed of technologies is taking shape that will extend the reach of search engines into the Web’s hidden corners. When that happens, it will do more than just improve the quality of search results — it may ultimately reshape the way many companies do business online.

Search engines rely on programs known as crawlers (or spiders) that gather information by following the trails of hyperlinks that tie the Web together. While that approach works well for the pages that make up the surface Web, these programs have a harder time penetrating databases that are set up to respond to typed queries.

This type of Deep Web searching is not only good for consumers wanting to book a flight, but it would allow businesses to cross-reference their data with research and news to come back with results that reflect political and social landscapes.

Who is pushing the boundary on this? I bet you can guess.

Google’s Deep Web search strategy involves sending out a program to analyze the contents of every database it encounters. For example, if the search engine finds a page with a form related to fine art, it starts guessing likely search terms — “Rembrandt,” “Picasso,” “Vermeer” and so on — until one of those terms returns a match. The search engine then analyzes the results and develops a predictive model of what the database contains.

In a similar vein, Prof. Juliana Freire at the University of Utah is working on an ambitious project called DeepPeep (www.deeppeep.org) that eventually aims to crawl and index every database on the public Web. Extracting the contents of so many far-flung data sets requires a sophisticated kind of computational guessing game.

I’m excited to start booking my travel.

Shedding Some Light on PaaS

When seeking to get one’s application off the ground there are many platform-as-a-service vendors that provide attractive, custom-built business process automation. By leveraging built-in infrastructure services, companies don’t have to spend their dollars on software deployment and delivery. Many vendors have emerged as market leaders, including Force.com, Bungee Labs and Coghead.

Depending on your application, some platforms have ‘flavor’and are better than others for nestling with certain apps. Other platforms are ‘flavorless’— meaning they are not predisposed to any specific types of applications. Which one is better in the long haul will become apparent over time.

164323

But relying on someone else’s platform can create an entirely new set of problems. For a budding application provider with an exceptionally healthy growth curve, the platform could have difficulty scaling to handle the multiplying number of users. Saving money on upfront deployment is one thing, but inhibiting future growth could mean missing out on future opportunity and business, and is ultimately more harmful to a new company. To illustrate this, what if Facebook was an application built on Salesforce.com? The student would surpass the teacher.

Vendor lock-in is another serious issue. PaaS offerings provide either proprietary service interfaces or proprietary development languages which tie an application to the platform. This is is sure to cause future headaches for developers.

Developers beware…

Public Me, Private Me

Drinking from a solo cup on the weekend isn’t a crime. There isn’t any shame there. I’m just not sure how I feel about my boss seeing photos of me engaging in these types of shenanigans on the weekend. Couple those photos with crass commentary by my friends and suddenly those photos from my Saturday night morph from a great time into embarrassing evidence of bad judgment.

giant-beer-cup

Everyone has a personal life. That personal life can easily become publicized if your shutter-happy friends like to document your nights out and post them on social networks. Social networks that your professional colleagues and boss are part of. This is where things get sticky.

The Public Me and the Private Me are theoretically the same. After all, I’m Me wherever I go. One could argue that as long as you’re not engaging in totally mortifying behavior there’s nothing wrong with a bleary-eyed photo or two. Ultimately the responsibility rests with you. You need to be accountable for your on-line presence. I admit that I’ve asked a friend to kindly remove a comment on a less than flattering photo of me, the lewd reference being pretty off-color.

Are your weekends and work-life mutually exclusive? How do you cope with the balance? Or do you even need to?

To read more on how to friend safely, take a peek here.

Going Digital – Are You Ready?

Back in late 2005, Congress passed the Digital Television and Public Safety Act of 2005. I find it strange that the word ‘television’and the word ‘safety’are in the same bill. Are TVs dangerous? I guess so. There’s a lot of allegations surrounding video games… Anyway, here we are 4 years later and it’s time for this thing to take effect.

The Act means means that all analog broadcasts must switch over to digital transmissions. Why the switch? Is it actually safer?

1950s_04_tv1

For starters, it t allows broadcasters to send more data more efficiently which means high quality picture and sound. Going digital also frees up the previously congested analog spectrum for mobile phones and radio use. But the biggest reason for the move is a motivator that keeps cropping up no matter what line of work you are in. Money. $20 billion dollars to be exact.

While everyone is trying to figure out the converter and vouchers they need, the Senate has made things a touch easier by postponing this ‘safety precaution.’  On Wednesday, February 4, the House voted extend the transition until June 12.

Executing this thing is a pain.