<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Mind Before You Mine</title>
	<atom:link href="http://mindbeforeyoumine.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://mindbeforeyoumine.com</link>
	<description>Web Analytics without the Spin</description>
	<lastBuildDate>Wed, 28 Dec 2011 17:48:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='mindbeforeyoumine.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Mind Before You Mine</title>
		<link>http://mindbeforeyoumine.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://mindbeforeyoumine.com/osd.xml" title="Mind Before You Mine" />
	<atom:link rel='hub' href='http://mindbeforeyoumine.com/?pushpress=hub'/>
		<item>
		<title>Visits are a Wet Blanket</title>
		<link>http://mindbeforeyoumine.com/2011/11/05/visits-are-a-wet-blanket/</link>
		<comments>http://mindbeforeyoumine.com/2011/11/05/visits-are-a-wet-blanket/#comments</comments>
		<pubDate>Sat, 05 Nov 2011 17:01:46 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[Measures & Metrics]]></category>
		<category><![CDATA[site optimization]]></category>
		<category><![CDATA[KPI]]></category>
		<category><![CDATA[measures]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[Visitors]]></category>
		<category><![CDATA[Visits]]></category>
		<category><![CDATA[Voice of Customer]]></category>
		<category><![CDATA[Omniture SiteCatalyst]]></category>
		<category><![CDATA[success events]]></category>
		<category><![CDATA[serialization]]></category>
		<category><![CDATA[HIPPO]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=756</guid>
		<description><![CDATA[Beware How You Count Events! I am aware that a large segment of the HIPPO market advocate visit based metrics*, but honestly, one can get into trouble if he blindly adheres to "Visits are what count."  All I am saying is perhaps a little more thought should be taken in setting up and serializing events.  Quick and non reflective solutions can truly lead to pain down the road. <a href="http://mindbeforeyoumine.com/2011/11/05/visits-are-a-wet-blanket/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=756&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h1>Beware How You Count Events!</h1>
<p><a href="http://kraftt.files.wordpress.com/2011/10/abstract-logo.jpg"><img class="alignleft size-medium wp-image-745" title="Abstract Logo" src="http://kraftt.files.wordpress.com/2011/10/abstract-logo.jpg?w=200&#038;h=300" alt="" width="200" height="300" /></a>I am aware that a large segment of the <a href="http://optimizationtoday.com/web-analytics/what-is-a-hippo/">HIPPO community</a> advocate <a href="http://mindbeforeyoumine.com/2010/04/18/from-worst-to-best-how-to-report-metrics-measures/">visit based metrics</a>, but honestly, one can get into trouble &#8211; big trouble &#8211; if one blindly adheres to &#8220;Visits are what count.&#8221; and apply visits like ketchup to every metric entree.  In web analytics there are three basic tempos &#8211; page views, visits, and visitors (actions over multiple visits). Page views represent the raw vibrance of your web site &#8211; lots of visitor actions that are difficult at times to summarize and trend.  So to gloss over the visitor churn and their propensity to reload or revisit pages, we often apply visits to metrics to put a wet blanket, metaphorically speaking, over the fire that raging in the page view traffic.</p>
<h3><span id="more-756"></span>Serialization Counts</h3>
<p>This is called <em>serialization</em> where we attempt to remove false or duplicate counts to report only the real counts.  By selecting visit instead of instance counts or serializing events by session ids,  we eliminate most of the false counts at sacrifice of some of the true counts.  Though this approach helps stabilize metrics, it can often make the metrics insensitive to changes in business decisions and visitor responses to those actions.</p>
<h3>Don&#8217;t Curb Your Enthusiasm!</h3>
<p>For example, imagine I am a business that depends upon customers placing items into shopping carts, and checking out shopping carts to make purchases.   If I only counted one shopping cart and one purchase per visit, clearly I would get different metrics than if I allowed my shopping carts and purchase metrics go free.   If I had promotions that increased shopping cart activity and purchases, my visit normalized metrics would show a less enthusiastic view than what may be raging (under the visit blanket) on the actual web site.</p>
<p>OK &#8211; so this may seem contrived. Who would not count every purchase? On the other hand, that should be our attitude in everything we count.  Why would I not want to count the event every time it occurs regardless if it happens multiple times within a visit or once at the end of many visits?</p>
<h3>We Do It All the Time!</h3>
<p>Another example, why do I want to count just entries that start a visit and not every time the visitor comes to the site from an external referrer?  The latter can occur multiple times within a visit or not occur again till many visits (These are called &#8220;Return Visits&#8221; or &#8220;Repeat Visits&#8221; not to be confused with new and returning Visitors &#8230; duh!?!).  I may be interested in how many visits include a sign-in, but why would I not also be interested how many times a visitor signs-in during a visit?  You may assume only once, but does your data confirm that assumption?</p>
<h3>Blame It On the Solution Designer</h3>
<p>For the solution designer, the question is then &#8220;how can I tell if the event is real and not a reflection of visitor navigation churn?&#8221;.  This involves establishing a &#8220;serialization&#8221; strategy in the client side instrumentation or for Omniture&#8217;s SiteCatalyst adding a serialization key to the event.  Too often a session cookie or session-id is too readily available to apply to collection essentially making the count a visit based metric.</p>
<p>What is required is that the designer take the extra effort to understand the real event being counted, and expand his repretoire of implementation tools to either declare the event only when it is true or find and establish a more appropriate serialization key. For example, may be a transaction key for a service task would be more appropriate than a visit id.</p>
<h3>The Pain Can Be Real!</h3>
<p>A client of mine is going through the pains of having their Key Performance Indicators transition from visit based to transaction based metrics dramatically changing their view of the business but at the same time abruptly breaking from the historical data that has been collected for years.  Like a blind person who has suddenly been given sight, there is an urge to pull out their eyes and return to their old &#8220;vision&#8221;.</p>
<h3>Stay the Path: Real is Good!</h3>
<p>It is too early to reap the benefits of the new vision, so we have to proceed assuming that with a vision that reflects more accurately visitor behavior, we will be able to more accurately measure effects of our  business decisions.  Whether it is marketing campaigns, changes in web design, or attempting to understand what the customer is saying (VoC), without true visitor based measures that are sensitive to visitor based actions, it is impossible to optimize performance.  The visit based metrics were like a wet blanket on the performance measures we needed to access these changes.</p>
<h3>I&#8217;m No Einstein but &#8230;</h3>
<p>All I am saying is perhaps a little more thought should be taken in setting up and serializing events.  Quick and non reflective solutions can truly lead to pain down the road.  Though it is our desire as analyst to make the trends evident and insights pop out from simple reports, we must still keep in mind as Einstein would say: <a href="http://c2.com/cgi/wiki?EinsteinPrinciple">&#8220;The solution should be simple but not too simple.&#8221;</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/756/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/756/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/756/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=756&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2011/11/05/visits-are-a-wet-blanket/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2011/10/abstract-logo.jpg?w=200" medium="image">
			<media:title type="html">Abstract Logo</media:title>
		</media:content>
	</item>
		<item>
		<title>None Is Not An Option!</title>
		<link>http://mindbeforeyoumine.com/2011/10/22/none-is-not-an-option/</link>
		<comments>http://mindbeforeyoumine.com/2011/10/22/none-is-not-an-option/#comments</comments>
		<pubDate>Sat, 22 Oct 2011 20:19:01 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[actionable insights]]></category>
		<category><![CDATA[actionable reports]]></category>
		<category><![CDATA[axioms]]></category>
		<category><![CDATA[best practices]]></category>
		<category><![CDATA[business funnels]]></category>
		<category><![CDATA[Channel Management]]></category>
		<category><![CDATA[classification]]></category>
		<category><![CDATA[conversion variables]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[eVars]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Omniture Discover]]></category>
		<category><![CDATA[Omniture SAINT]]></category>
		<category><![CDATA[Omniture SiteCatalyst]]></category>
		<category><![CDATA[Plug-ins]]></category>
		<category><![CDATA[Segments]]></category>
		<category><![CDATA[sprops]]></category>
		<category><![CDATA[success events]]></category>
		<category><![CDATA[tag plans]]></category>
		<category><![CDATA[visitor segments]]></category>
		<category><![CDATA[web analytic capabilies]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=734</guid>
		<description><![CDATA["None" is a telling symptom of a problem and "None is Not an Option" encapsulates an approach to diagnosing and resolving the problem.  When reports are designed such that None is Not Option and "None" appears in a report, it becomes an important tool for diagnosing the problem.  The resulting reports not only cover all user behaviors but with discipline allow the reports to address multiple questions and with extra diligence support multiple actions.
 <a href="http://mindbeforeyoumine.com/2011/10/22/none-is-not-an-option/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=734&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>Building Reports that Cover All Behavior</h2>
<p><a href="http://kraftt.files.wordpress.com/2011/10/shattered-glass.jpg"><img class="alignleft size-medium wp-image-746" title="Shattered Glass" src="http://kraftt.files.wordpress.com/2011/10/shattered-glass.jpg?w=225&#038;h=300" alt="" width="225" height="300" /></a>In my line of work consulting with various customers I have been able to see common threads in how different companies approach web analytics and apply it to their business.  One puzzling aspect that becomes immediately obvious to see but takes longer to understand is how many reports seem to cover only a small portion of web behavior with most visitors and visits pushed into the general category of &#8216;None&#8217;.  None meaning that visitor does not belong to any of the categories for the dimension being reported.</p>
<h3>A Puzzling Predicament</h3>
<p><span class="Apple-style-span" style="font-size:16px;color:#444444;line-height:24px;">What is puzzling is: How do such reports provide useful information on visitor behavior?  The next question that comes to mind is who are those people that belong to &#8216;None&#8217; or for the person that implements the report: Did I set up the metric correctly? In brief, How many of that are None should be Something?  This has lead me to the concept of &#8220;None is not an Option&#8221; reporting, meaning None is not a category of behavior but an indication of analytic failure or incompleteness.</span></p>
<p><span id="more-734"></span>As a consultant I can typically give advice but too often not work through the details implied in the advice.  In my current position I have had the opportunity to be the designer and implementer and put this concept to practice.  As with most theory, implementation of the theory presents its own challenges and insights.  Here I will share what I have learned.</p>
<h3>Fragmented Segments</h3>
<p>Because we will be dealing with implementation I will focus on a specific tool &#8211; Adobe (Omniture) SiteCatalyst 14. The concepts can be applied to Google Analytics though it has a different scheme for defining custom variables or data dimensions. Google forces one into schemes of grouping variables into limited number of tag plans (called slots) that one has to consider the strategic implications of the data being collected.</p>
<p>With SiteCatalyst one can quickly get into the mode of treating page properties (sprops) or event dimensions (eVars) as simple fields that one collects and then stitches together later.  The unfortunate out come of this approach is that one quickly runs out of the available number of custom variables that Omniture allows, though at first they may seem inexhaustible.  On top of this, one has the rude awakening that ability to relate these various fields in reporting has limits to number of correlations or full relations that allow one to pivot or drill down from one dimension to another.</p>
<h3>Misplaced Expectations</h3>
<p>When I confront the customer with the fragmented segmentation and non-strategic value of the SiteCatalyst reports, the response back is &#8220;We can get most of what we want from Discover&#8221;.  Discover is Omniture&#8217;s data warehouse exploration tool that removes most of the irritating limits of SiteCatalyst and allows one to cross correlate all dimensions.  With SiteCatalyst 15 many (not all) of these limitations go away so one will be tempted to continue this approach without further reflection.</p>
<p>The problem is that though Discover and SiteCatalyst 15 remove limitations on how variables can be correlated and queried, they do not address the underlying concern &#8211; fragmented segments.  These are segments that may or may not have relation to each other in that they clearly represent segments that most visitors do not belong. This shows up as a large percentage of visits or visitors that belong to &#8220;None&#8221;.  On a practical level, these variables represent a waste of a perfectly good slot that could be used to collect data that covers more visitors.  So we got to do some heavy data minding in SiteCatalyst before we can do data mining in Discover.</p>
<h2>Analytic Capabilities as Strategic Plans</h2>
<p>Clearly there are situations where the data collected will apply only to a small portion of visitors or for specific tasks that the visitor performs (acquisition, on boarding, marketing, etc.)  So as an analytic solution designer one is attempting to deal with a large number of different demands from different business units for data specific to their special needs. Without a strategic plan or master solution one is forced to service these request on a one by one basis leading to ad hoc solutions for each case.  This may satisfy the customer&#8217;s immediate need but rarely provides a lasting solution because there is always the next question that arises from the data.</p>
<h3>The Fundamental Question</h3>
<p>As a data architect (web analytic professionals are called upon to perform many roles) one has to consider once the requested data is collected how will it be used and integrated into the business process. So a good followup question back to the customer when presented a list of data fields that should be collected is: If you had this data, what would you expect to find and what decisions would you expect to make resulting from the findings?</p>
<p>The back and forth in addressing this question will usually lead to development of a web analytic capability that addresses a business need rather than describe an application or sequence of pages.  Since business processes are more formalized than applications, there is good chance that the resulting analytic capability can be applied in other aspects of the business and across all the web properties of the company.</p>
<h3>How Analytic Capabilities Pop Up</h3>
<p>Analytic capabilities are the foundation of all current web analytic tools. Web analytic providers have from the very beginning looked for common needs across all their customers to provide reports that meet those needs.  Whether it is tracking marketing campaigns to the customer&#8217;s site, tracking shopping cart activity across the various stages of the acquisition process or simply evaluating the effectiveness of forms most tools provide &#8220;canned&#8221; reports that address these needs.  More recently special reports suites have popped up to assist companies in evaluating their effectiveness in social media marketing!</p>
<h3>Plug-In or Customize?</h3>
<p>This is all well and good if your business fits the model represented by the analytic capability and the reports satisfies your business needs.  However in application one still needs to adapt the capability to the specific needs of the business.  So to hedge the market, web analytic providers provide custom variables and reports to allow businesses to define their own analytic capabilities.</p>
<p>Omniture&#8217;s offering is particularly good in this respect allowing even their standard packages such as the shopping cart to customize names of variables and events to their customer&#8217;s specific requirements.  They also provide a rather large collection of &#8216;plug-ins&#8217; that add specific capabilities to their standard instrumentation (collection) script.  One that I would highly recommend is their Channel Manager plugin that results is an All Sources Report for all external referrals (paid and non-paid) to a site!</p>
<h3>Up and Down Sides of Flexibility</h3>
<p>With such flexibility one is provided great opportunity but with the very real downside of providing enough rope to hang yourself.  When using custom variables and reporting one must continue the discipline of identifying analytic capabilities that address general needs that can be adapted to specific business requirements.</p>
<p>For example, the company I am working with now has an acquisition process but does not follow a shopping cart model.  The events and special variables associated with shopping cart model can be adapted to a degree. There are products and purchases of products but the business does not refer to them as products and purchases, they have their own business terms.</p>
<p>Special KPIs (success events) and segments (eVars) must be defined which may provide answers to overall performance but the next questions of coarse are &#8220;Why are the KPIs what they are?&#8221;, &#8220;Can we improve them?&#8221;. Therefore, in this case, this means tying funnel reports to pathing reports and ensuring drill down (correlation) between KPIs and segments.</p>
<p>The result has been collection of web analytic capabilities designed specifically for the business needs but sufficient to cover all acquisition behaviors and visitors in SiteCatalyst (focused on understanding normal and daily monitoring) and Discover (focused on &#8230; well .. discovery).</p>
<h2>How to Define and Design a Web Analytic Capability</h2>
<p>As was mentioned above, developing analytic capabilities is a discipline that has to be carried over from the web analytic providers to the custom reporting.  As a discipline we should be able to describe the various activities involved in its development. To this end let us start by formalizing the concept of Web Analytic Capability.</p>
<p>A Web Analytic Capability has a well defined purpose that is usually expressed in business (not application) questions that can be addressed by the capability.  This is then translated into sets of reports and the data that must be collected to support these reports.  Another essential element that adds power to the reports are the decisions or actions that can be taken based upon the data and/or reports.  In other words, design of actionable reports.</p>
<h3>Business Requirements without Analytic Jargon</h3>
<p>The requirements are done independent of the specific analytic tool jargon.  In the case of Omniture, talk of eVars, sprops, success events, correlations, relations, pathing etc. must be expunged from the conversation.  Campaigns, call to action, key performance indicators, channels may be appropriate depending on the business. The discussion is with business owners that make decisions and data analysts that support those decisions. Therefore the requirements should be expressed in their business terms and translated later into web analytic jargon in the implementation.</p>
<h3>Universal Rather Than Particular Solutions</h3>
<p>In the solution design you are attempting to develop a capability that is &#8220;universal&#8221;. In this case universal means capabilities that cover all visitors or segments of visitors.  If the request is to track a specific link, the response should be why not track all links and provide means of selecting the specific links the customer is interested in.  If the request is to segment visitors by specific actions provide a mechanism to expand and identify other actions in the future.  In the case the of Channel Manager mentioned above, if the request is to track marketing campaigns to your site, why not track all of the sources to the site providing an All Sources Report to track everything.</p>
<p>In the back and forth between need and solution the result should an analytic capability that is understood by the business and the solution designer to the extent that the capability can be used to meet the business need beyond the specific concern of the immediate project.</p>
<h3>Universality&#8217;s Relation to None</h3>
<p>One of the implications of universality is that all visitors are categorized and None is not an option.  Even if the visitor does not belong to any of the specific segments of interest currently, place them into a segment such as &#8216;Unknown&#8217;, &#8216;Currently Not Categorized&#8217;, &#8216;Not of Interest at this Moment&#8217;. Anything but &#8216;None&#8217; or &#8216;Unspecified&#8217; which are terms used by SiteCatalyst to bucket all the counts that cannot be attributed to the specific classification or variable.</p>
<p>This &#8220;Unknown&#8221; category will serve two important purposes. First it will identify the scope of the specific parameter or segment.  That is to say the total number of opportunities that the data can be collected compared to the specific data of interest. Second it separates the opportunity from non-opportunity represented by &#8216;None&#8217;. In others words &#8216;None&#8217; represents not being able to collect the desired data and hence a valuable diagnostic tool in ensuring quality data.</p>
<p>Even if you don&#8217;t expect to collect the specific data for all visitors there are likely cases where you do expect the data to be a necessary prerequisite for a latter user action or event.  In these cases the KPI should have no attributions to &#8216;None&#8217; and if some are found you have your first clue that something is wrong and a need to track down the cause.</p>
<p>On the other side, the &#8216;Not Categorize&#8217; category can be correlated to other dimensions allowing for potential identification of new segments that can be tracked, eventually moving visitors into other categories and providing more insight into the process being observed.</p>
<h2>The Concept Put To Practice</h2>
<p>By defining web analytic capabilities you will progress from simply collecting behavior data to understanding how behavior should be reflected in the data collected.  The capability is sufficiently  abstract to provide a guide for how to address not only a specific problem but the entire class the problem represents.  This in turn provides a framework for categorizing behavior and actions that is more complete and where None is not an option.</p>
<h3>Get the Questions Straight</h3>
<p>For example,  a frequent request starts with &#8220;We have a form that the customer must complete &#8230;&#8221;.  So one might respond with &#8220;Form Analysis&#8221; but as the discussion evolves the issues have much greater scope and questions more probing than &#8220;Where is the user having troubles and abandoning the form?&#8221; Not that it is an important question but just not the first question.  The larger question is task completion: Those who start a task (form, wizard, web flow) how many complete the task?  Or task relevancy: Is the task we gave them the one they wanted to perform?</p>
<p>What type of task are they performing?  Acquisition (Cart Checkout) and On-boarding (Registration) are tasks where the main objective is getting the visitor from Start to Finish as quickly and as simply as possible without diverting and distracting the customer with other tasks.  Service tasks on the other hand are dynamic where the user has more freedom to preform many actions within a task process.  These are user convenience features that need to be tracked and understood.</p>
<p>So looking at the task as a process where the form may be an element, the more relevant and immediate questions are:</p>
<ol>
<li>How and where do users enter the task?</li>
<li>What scenarios can the user perform within the task process?</li>
<li>What has been accomplished when the task is completed?</li>
<li>Finally, where and why do users abandon the task once started?</li>
</ol>
<h3>Start with the First Question</h3>
<p><span class="Apple-style-span" style="font-size:16px;color:#444444;font-family:Georgia, 'Bitstream Charter', serif;line-height:24px;">The first question is addressed by tracking both external and internal channels so that when a user starts a task we know how they came to the site and what they viewed prior to starting the task.  One may go a step further and provide click placement tracking to understand where on the referring page the user clicked to start the task. Here the &#8220;None is Not an Option&#8221; solution is obvious: track all sources coming to the site (Paid AND Organic) and track all channels within the site.  If you are doing click placement tracking consider tracking all links.   When this is accomplished, None means a channel or action that is not tracked.</span></p>
<h3>Triage Business from Analytics</h3>
<p><span class="Apple-style-span" style="font-size:16px;color:#444444;font-family:Georgia, 'Bitstream Charter', serif;line-height:24px;">The second question is more specific and takes more effort to analyze and abstract into an analytic capability.  The business owner wants to know everything and the significance of every field entered and action taken.   So triage most be done to separate what represents business concerns and what reflects user behavior.  The data for the latter can be collected with web analytics with the former best left to back end transaction engines, CRM and other business data warehouses.  The issue here for the solution designer is establishing a way to connect the data on the backend to &#8220;color&#8221; business facts with user behavior data.</span></p>
<h3>Establish Realistic Expectations</h3>
<p>Categorizing the scenarios preformed within a task (especially a service orient task) has additional challenges.  Is this first time the visitor performed the task? If they are returning how do their previous visits effect the current visit?  When is the intent of the visitor revealed &#8211; at the start, during or at completion?  If they select certain options or perform specific actions within the task, is it truly of strategic importance to capture these machinations or can this be part of what is reported at the completion of the task?</p>
<p>There are many more questions that can be asked but from a web analytic capability perceptive, the NINAO options are:</p>
<ol>
<li>At the start of the task, have I captured everything I know about the user that will define or effect the user experience during the task process?</li>
<li>At the completion of the task, have I captured everything that was accomplished during the process hopefully available on the confirmation of the successful completion of the task?</li>
<li>For actions within a task process, how can these be best captured: Milestone (Step) events, click events, or changes in content generating page view events (even if the action does not result in an HTML page transition)?</li>
</ol>
<h3>Mockup Reports to Confirm Design</h3>
<p>Once having defined what you believe covers all entries and exits from the process cross check this with your measures (events) and how they will be segmented and broken down.  This exercise will result in mockup reports that can be reviewed with the business owner and analysts.</p>
<p>In the back and forth exchange, the reports can be tuned and simplified such that what is important to understand is clear and presented succinctly in the proposed report and not encumbered by user machinations that can divert attention from what is important to understand and follow at a high level.</p>
<h3>Now the Other Stuff</h3>
<p>Now you can consider Form Analysis, Fallout Reports, Page Pathing Reports that will help to answer questions that will arise from the reports you have defined for task analysis.  From my experience, I provide Funnel (or Waterfall) Reports that provide a means of understanding the overall performance to business owners but then can be broken down and analyzed at the next level by data analysts using these more detailed reports.</p>
<h2>From Solution Design to Implementation</h2>
<p>Once solution design is in place, you can now consider how the design will be implemented in a particular web analytic tool. The solution design should identify how the data will be collected and the options for configuring the web analytic capability to a specific solution, in this case task.  The analytic capability will have to make public to the web designer or application developer what is expected for the data collection.</p>
<p>For example, with Form Analysis it is expected that the web designer use the standard HTML elements that make up a form.  But with the Dojo and other web toolkits this is not as straight forward so the standard web analytic provider form analysis plug-in will not work.  Even if it does, how are the forms identified for analysis and the fields named for reporting?</p>
<h3>Application Execution Context API</h3>
<p>This leads to identifying and formalizing the API between the web application and web analytics and how the web analytics gets access to the application execution context (AEC) on the server or client.  With standard HTML the AEC is the Document Object Model (DOM) and HTML Event Model but in most cases this not sufficient to collect all the data required so there must be mechanism to bring the relevant business data into web analytic stream.  Traditionally this is done by including static analytic tags on each page that the web analytic instrumentation collects and sends to the web analytic provider (WAP).</p>
<h3>Dynamic Instrumentation and Data Collection</h3>
<p>However more and more it is necessary to dynamically collect and process the data on the client and at times maintain state between page transitions in the web analytic instrumentation.  How this is addressed is a much larger topic that is beyond the scope of this post.  However it is in this environment of static tags that we often must consider our tag plans.</p>
<p>When considering the number and configuration of variables and properties that make up a tag plan for a web analytic capability, one has to consider how these will all be brought together in the various reports to the customer.   The application or web developer is usually willing to provide fields of data that describe a page or context for an action, but it is the web analytic implementation that must stitch these various fields into meaningful tag variable assignments.</p>
<h3>Now the Trouble Begins</h3>
<p>For example, in the task web analytic capability defined above, clearly each task must be identified  by a task name that can be attributed to Task Start and Task Completion events.  For SiteCatalyst this means an eVar that can be attributed to Start and Completion success events assigned to the appropriate pages &#8211; start form and confirmation page.  Now we have different segments we need to attribute to these same events but also must relate to the task name.  Of course each task name will have their own list of scenario segments as well as task completion out comes.</p>
<p>Now the vulgarities of SiteCatalyst come into play.  I can make the Task Name eVar have full relations and then define eVars for each of the segments (or stitch everything together is Discover where all eVars have full relations).  Then from a Task Name report I can break down the results by any of the segments (eVars) I have defined. With this approach I quickly run out of eVars.  Even if I am &#8216;strategic&#8217; in my eVar assignments such that multiple tasks can share the same segment, the eVar will typically cover only a portion of the visitors and introduces fragmentation and None as an Option for those segments.</p>
<h3>A Way Out of Trouble</h3>
<p>Fortunately SiteCatalyst provides an alternative solution &#8211; classification.  This is a great feature that exploits the <a href="http://wp.me/pqZC6-P" target="_blank">Fifth Axiom of Web Analytics</a>: &#8220;Data that is different is counted as different.&#8221;  In this case, the Task Name eVar includes not only the Task Name but everything we know about the user at the start of the task as well as at the completion of the task!</p>
<p>Using Omniture&#8217;s SAINT, these variables are keys that can be classified on the backend to represent all of the segments I have identified in my solution design. So instead of dozens of eVars I have one and now have reports that drill down from the task name.  With additional discipline all the tasks can be reported and broken down in the same report!</p>
<p>The trick here is to declare the eVar with each event (they are after all &#8220;event variables&#8221; meaning variables specific to the event).  Then classification ties the segments that include both all events.  For example, task_name | user_senarios is declared at task Start and task_name | user_senarios | task_outcomes declared at task completion. Then through classification task_name and user_senarios that are known at the start can be attributed to task Completion along with the task_outcomes.</p>
<h3>Now Pile On</h3>
<p>The implementation approach is straight forward with what I call Piling On. All the parameter values that one would normally assign to different variables in a static tag (or AEC) are concatenated together (with proper separators) into one value assigned to an eVar or sprop.  The key is to make the Key differentiate all the possible variations in segments you need.</p>
<p>Once you have defined the classification structure in SiteCatalyst (ADMIN &gt; conversion &gt; classification), you can retrieve the template and key values from SAINT in a tab delimited file; separate into your desired segments via Exel (via those cleaver separators you inserted at collection), and then upload back to SAINT. SAINT then classifies all of the data collected regressively (meaning all the data all the way back to the beginning) so you can change or modify segments through all report periods.</p>
<p>The result is that every visitor and visit gets assigned to proper segments, the segments are correctly correlated and None is Not an Option!!!  In fact, None in this case means keys (eVar values) that have not been classified.  So occasionally you need to retrieve the unclassified keys from SAINT (using the Null field option in the Export), rinse and repeat.</p>
<h2>Summary</h2>
<p>&#8220;None&#8221; is a telling symptom of a problem and &#8220;None is Not an Option&#8221; encapsulates an approach to diagnosing and resolving the problem.  When reports are designed such that None is Not Option and &#8220;None&#8221; appears in a report, it becomes an important tool for diagnosing the problem.  The resulting reports not only cover all user behaviors but with discipline allow the reports to address multiple questions and with extra diligence support multiple actions.</p>
<p>In the case of Omniture&#8217;s SiteCatalyst, the reports are available to a wider audience than what can be provided by Discover.  Though most serious analysts work exclusively within Discover to avoid the rather arbitrary limits of SiteCatalyst, by providing strong reports in SiteCatalyst the work performed in Discover is even further empowered.</p>
<p>First, fragmented segments that are likely loosely coupled are now tightly correlated in both environments, allowing drill-down in SiteCatalyst that can be further broken down in Discover. Second, insights discovered in Discover can be more directly related to SiteCatalyst reports, allowing business owners to more quickly grasp and exploit the insight.  Finally, the data mining is more effective once the data has been mind, because the data has the facts (keys) already rather than stitching the facts together from potentially disparate fields.  Hence, Mind Before You Mine.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/734/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/734/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/734/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=734&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2011/10/22/none-is-not-an-option/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2011/10/shattered-glass.jpg?w=225" medium="image">
			<media:title type="html">Shattered Glass</media:title>
		</media:content>
	</item>
		<item>
		<title>Oracle Announces &#8220;A Cloud in a Box&#8221;</title>
		<link>http://mindbeforeyoumine.com/2010/10/12/oracle-announces-a-cloud-in-a-box/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/12/oracle-announces-a-cloud-in-a-box/#comments</comments>
		<pubDate>Wed, 13 Oct 2010 00:33:32 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Elastic Computing]]></category>
		<category><![CDATA[Exalogic]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=614</guid>
		<description><![CDATA[recently Oracle CEO Larry Ellison announced Oracle’s entry into the cloud-computing arena with Exalogic Elastic Cloud, that has been dubbed “Cloud in a Box”. Ironically the Cloud is supposed to eliminate the Box, so technically this is an oxymoron following current definitions and trends in Cloud Computing. So how can Oracle claim to have a cloud in a box? <a href="http://mindbeforeyoumine.com/2010/10/12/oracle-announces-a-cloud-in-a-box/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=614&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>An Oxymoron Worth Making?</h2>
<p><a href="http://kraftt.files.wordpress.com/2010/10/img_0398.jpg"><img class="alignleft size-full wp-image-707" title="Cloud Boxing" src="http://kraftt.files.wordpress.com/2010/10/img_0398.jpg?w=640" alt=""   /></a>More recently Oracle CEO Larry Ellison announced Oracle’s entry into the cloud-computing arena with Exalogic Elastic Cloud, that has been dubbed “Cloud in a Box”<a href="#_edn1">[i]</a>.  Ironically the Cloud is supposed to eliminate the Box<a href="#_edn2">[ii]</a>, so technically this is an oxymoron following current definitions and trends in Cloud Computing<a href="#_edn3">[iii]</a>.  For example, elastic is interrupted as being able to scale computing resources to demand by incorporating more virtualized servers in the cloud and hence only pay for resources consumed.  A box on the other hand is a set configuration of resources that must be purchased (as a box) and remains idle as a box until peak demand utilizes its entire potential.  So how can Oracle claim to have a cloud in a box?<span id="more-614"></span></p>
<h3>Cloud from a Different Perspective</h3>
<p>To answer this, we will need to expand our perspective and look at architecture from a different point of view, perhaps wave our hands at certain points.  First, the information systems space that builds boxes, which includes IBM, Oracle, Teradata, and more recently HP, has typically differentiated their products into warehouses and appliances.  The latter are data boxes configured to perform one task very fast and very efficiently, and once setup and integrated into a business process, the appliance does not change.  This would include transaction engines, content management, customer relation databases, and most financial databases. (As I have pointed out before, one does not go mining for profits but follows well defined accounting procedures with set data structures).</p>
<p>These appliances have critical roles in the business processes but often the boundary between minding the data and mining the data can be very fuzzy and broad.  Though you would not mine financial data for profits, you would mine the same data for insights affecting those profits.   Though you might have a content management system supporting your authoring and serving your content, you may want to feedback information as to who is viewing the content and their reaction to that content, perhaps target specific content dynamically to segments of visitors as they are discovered in real time.</p>
<p>Normally all this financial, transactional and presentation data is pulled off to a data warehouse where it can be integrated, aggregated and analyzed requiring great flexibility in augmenting, querying and presenting the data.  Instead of fixed static queries that are optimized for retrieval or storage in an appliance, the queries are free formed and changing within a data warehouse such that a different form of optimization must take place to search and aggregate large (terabyte to petabyte) sized data warehouses.  In Oracle’s case the appliance is Exadata with fast storage and access and the warehouse is Exalogic with computational power to perform functions (logic) on the data. (To make things more confusing the latter is sometimes called a “data warehouse appliance” emphasizing that the whole thing is configured as box right out of the box so to speak, but we have that covered  with the term “box”.)</p>
<h3>Boxing in the Cloud</h3>
<p>But this is only part of the story. Until recently, these have been separate offerings with the appliances operating in real-time and data warehouses operating in the background, typically as batch processes, at their own “more strategic” tempo.  However as the data from different sources becomes more inter-related, such as forming social networks among thousands of users every minute, and requires near real-time warehouse-like processing in response, such as immediately using the new networks for post messages.  The warehouse is no longer a very large database but more a massive parallel computing system that must process and access large amounts of inter linked data in real-time.</p>
<p>Exalogic is Oracle’s attempt at massive parallel computing (MPC) in a box configuring up to 360 processing cores<a href="#_edn4">[iv]</a> to operate on a computational problem.  The claim by Ellison is that as a server the box can process 1 million requests per second sustained and that two of these boxes would be sufficient to service all Facebook customers.  However, there are measures more important that web hits to be considered and Facebook’s requirements illustrate this.  Essentially Oracle’s Exalogic is 360 web servers built on top of an Oracle Grid RDBM.  The same processors managing requests are also performing the database queries.  Is there a problem with this?</p>
<h3>Sequential Processing Giant Goes Concurrent</h3>
<p>Oracle has practically no experience in parallel computing as it applies to database design and implementation.  Oracle is after all the archetype for communicating sequential processing where grid means communications among databases where everything is shared and potentially locked down.</p>
<p>Interestingly their new CEO, Mark Hurd (formally ousted CEO at HP)<a href="#_edn5">[v]</a>, does have parallel computing exposure through Teradata as CEO of NCR.  So this is an interesting twist that has not been sufficiently exposed over the HP / Oracle brouhaha.  I would consider this a clue that parallel computing is at the core of the offering and not simply a box with 360 virtualized servers.  Teradata on the other hand has over 30 years corporate experience applying parallel processing to computation and 20 years developing parallelized database systems based upon Share Nothing Architectures<a href="#_edn6">[vi]</a> (the antithesis of what Oracle has been developing over that same period).</p>
<p>To me this represents a subtle but significant core shift in computing whether cloud or not.  Oracle and IBM have deep pockets to acquire the expertise that Teradata has developed over the long haul.  However, I would suggest that Oracle go outside of Silicon Valley because the concept of concurrent programming is counter to the current culture there.  In the end, it is not the hardware but the operating system and the “programming” paradigm that can exploit the potential of the hardware that will be the breakthrough and provide the change needed to succeed.</p>
<h3>Elastic Box: A Cloud by a Different Name</h3>
<p>Returning to the original question with respect to elastic.  Though Oracle’s offering is a box, elasticity within the box will depend on how well demand from various sources can be managed to allow flexible allocation of resources within that box.  Typically architectures assume that each box is dedicated to a single function – web server, transaction engine, financial database, etc.  Can all these functions be performed within the same box such that resources can quickly roll over to the function having the most demand at that moment and therefore ensuring that the box is effectively utilized while satisfying all demand?  This assumes that the combined demand on all the individual boxes can be smoothed when combined into a single box.</p>
<p>This is the overall assumption within an elastic cloud but demand is spread over a larger number of functions and wider global demand.  The flex point for cost of computing from external to internal cloud is the point where demand is well defined and smooth and the cost of owning and maintaining the equipment is much less than the utility cost of resources.  In other words, when elasticity is not an over-riding factor.</p>
<p>In Oracles case and for others in this space, removing the need to allocate function to separate boxes and providing more flexibility in reconfiguring or expanding functions can lead to further cost savings and resilience over the traditional IT architectures.  In this way the box acts as an internal cloud and can be argued to be elastic, but only if you change the currently accepted definition<a href="#_edn7">[vii]</a>.</p>
<h3>One Cloud of Many Boxes</h3>
<p>For the cloud purest that believes that the cloud is a monolithic computer, these kinds of boxes will increasingly populate the network to achieve that vision.  It will only occur when the programming paradigm allows “resources” to follow “apps” much as workers follow jobs within companies.  Currently we see apps allocated to resources and programmed specifically for those resources.  So the shift is big &#8211; a shift that I doubt Oracle has not even started to make.</p>
<h2>Works Cited</h2>
<p>Abel, J. C. (2010, 7-September). <em>Mark Hurd Lands At Oracle, HP Sues</em>. From Wired:  http://www.wired.com/epicenter/2010/09/former-hp-ceo-mark-hurd-lands-on-his-feet-at-oracle/</p>
<p>Beal, B. (2010, 20-September). <em>Oracle Exalogic is the company&#8217;s &#8220;cloud in a box&#8221;</em>. From  SearchOracle:  http://searchoracle.techtarget.com/news/2240022616/Oracle-Exalogic-is-the-companys-cloud-in-a-box</p>
<p>Bhatia, A. (2007, 10-December). <em>Shared Nothing Architecture</em>. From Toolbox.com:  http://it.toolbox.com/wiki/index.php/Shared_Nothing_Architecture</p>
<p>Closson, K. (2007, 9-August). <em>Nearly Free or Not, GridSQL for EnterpriseDB is Simply Better Than  Real Application Clusters. It is Shared-Nothing Architecture After All! </em>.  From Kevin Closson’s Oracle Blog:  http://kevinclosson.wordpress.com/2007/08/09/nearly-free-or-not-gridsql-for-enterprisedb-is-simply-better-than-real-application-clusters-it-is-shared-nothing-architecture-after-all/</p>
<p>Considine, J. (2010, 22-September). <em>&#8220;Cloud in a Box&#8221; Gets Boxed In </em>. From Cloud Switch:  http://www.cloudswitch.com/page/cloud-in-a-box-gets-boxed-in</p>
<p>Dignan, L. (2010, 20-September). <em>Oracle&#8217;s Exalogic box: Cloud washing at its best?</em> From ZDNet:  http://www.zdnet.com/blog/btl/oracles-exalogic-box-cloud-washing-at-its-best/39343</p>
<p>Snow, R. (2010, 5-October). Teradata Advances Personal Marketing Using Analytics. From Ventena Research: http://www.ventanaresearch.com/blog/commentblog.aspx?id=3984</p>
<p>Williams, A. (2010, 30-September). <em>Why the Oracle Exalogic Cloud is Not Elastic </em>. From ReadWrite  Cloud:  http://www.readwriteweb.com/cloud/2010/09/why-the-oracle-exalogic-cloud.php</p>
<div>
<hr size="1" />
<div>
<p><a href="#_ednref">[i]</a> (Beal, 2010)</p>
</div>
<div>
<p><a href="#_ednref">[ii]</a> (Dignan, 2010)</p>
</div>
<div>
<p><a href="#_ednref">[iii]</a> (Williams, 2010)</p>
</div>
<div>
<p><a href="#_ednref">[iv]</a> Oracle Exalogic is powered by Sun’s server and system technology and contains up to 360 CPU cores, 2.8TB of RAM, and 40TB of storage in a single rack of equipment. The whole configuration priced just over 1 Million dollars.</p>
</div>
<div>
<p><a href="#_ednref">[v]</a> (Abel, 2010)</p>
</div>
<div>
<p><a href="#_ednref">[vi]</a> (Bhatia, 2007)  For Teradata&#8217;s specific product offering called Teradata Relationship Manager (TRM) see <a href="http://www.ventanaresearch.com/blog/commentblog.aspx?id=3984">(Snow, 2010)</a>.</p>
</div>
<div>
<p><a href="#_ednref">[vii]</a> (Considine, 2010)  This is an important and in depth critic of Oracle’s offering from an informed recognized source in cloud computing.</p>
</div>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/614/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/614/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/614/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=614&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/12/oracle-announces-a-cloud-in-a-box/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/10/img_0398.jpg" medium="image">
			<media:title type="html">Cloud Boxing</media:title>
		</media:content>
	</item>
		<item>
		<title>Breaking: Oops, Google Wave a Fail!</title>
		<link>http://mindbeforeyoumine.com/2010/10/11/breaking-oops-google-wave-a-fail/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/11/breaking-oops-google-wave-a-fail/#comments</comments>
		<pubDate>Tue, 12 Oct 2010 02:56:16 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Google Wave]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=609</guid>
		<description><![CDATA[Has the Brain’s Plan Been Foiled? OK &#8211; So its not quite breaking news. It took awhile to recover from the shock. For the moment just play along. Google announced recently that they are suspending further development of Google Wave[i]. &#8230; <a href="http://mindbeforeyoumine.com/2010/10/11/breaking-oops-google-wave-a-fail/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=609&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>Has the Brain’s Plan Been Foiled?</h2>
<p><a href="http://kraftt.files.wordpress.com/2010/10/img_0393.jpg"><img class="alignleft size-full wp-image-698" title="IMG_0393" src="http://kraftt.files.wordpress.com/2010/10/img_0393.jpg?w=640" alt=""   /></a>OK &#8211; So its not quite breaking news. It took awhile to recover from the shock.  For the moment just play along.</p>
<p>Google announced recently that they are suspending further development of Google Wave<a href="#_edn1">[i]</a>.<br />
<span id="more-609"></span></p>
<blockquote><p>“Was it a social network or a collaboration tool on steroids? Nobody could quite agree,      and it&#8217;s one reason for the demise of Google Wave.” Georgina Swan, PCWorld<a href="#_edn2">[ii]</a></p>
<p>“&#8230; admitting failure and moving on is another key lesson in  managing innovation.” &#8212; Karim R. Lakhani, Harvard Business Review<a href="#_edn3">[iii]</a></p>
<p>“Still, I think Google&#8217;s public celebration of its recent product failures strongly suggests that its growth story is far from over.” John Keeling, The Motley Fool<a href="#_edn4">[iv]</a></p></blockquote>
<p>I can hear the Brain going through his checklist with Pinky:</p>
<ul>
<li>Real time interaction among networked individuals – Check!</li>
<li>Uber mash up and convergence of different social interaction metaphors – Check!</li>
<li>Uber sharing and collaboration allowing networks formed in Web 2.0 to accomplish objectives in Web 3.0 – Check!</li>
<li>A platform that can be expanded by third parties to provide capabilities that has yet to be envisioned – Check!</li>
<li>A really really really smart spell checker<a href="#_edn5">[v]</a> – Check?!</li>
<li>All wrapped in an obsolete desktop metaphor with a UI that is so ugly and complex that only a geek could love it<a href="#_edn6">[vi]</a> – Check! What??? <em><strong>Narf!!!</strong></em></li>
</ul>
<h3>Geeks: A mass media they make &#8211; <em><strong>Not!</strong></em></h3>
<p>Anyone of these points could become the next media gestalt that changes how we operate in the world at large. However these capabilities, though requiring more computational capability, are more incremental to what has already been accomplished in Web 2.0.</p>
<p>Collaboration already can be done with Webex and combining email with documents is not that different from Google Groups. In the end the mashup was too complex for anyone but a techno geek to comprehend.  To bring social networks into email and messaging has already been done by Facebook and MySpace with Twitter and Tumblr addressing the need to express and publish thoughts and ideas.</p>
<p>To return to a desktop metaphor of work groups and browsers just doesn’t seem to fit the persona of a revolution.  A better spell checker and watching someone correct your text while you are typing is not sufficient change and in some ways creepy.</p>
<p>It is the masses of social beings (which excludes many of us geeks) accepting and adopting a technology that creates a medium.  Make no doubt about it, you have to become a medium phenom if your goal is to take over the world.  You can&#8217;t be like Microsoft, just coming out with new versions of the old stuff, or in Googles case tweak the search algorithm. That is the direct road to has been.</p>
<p>The untapped potential is the networks created in Web 2.0 can be used to accomplish things…together…in Web 3.0!  Working together is not a new thing, but potentially disruptive if facilitated and accomplished on a massive scale.  Unfortunately, Google is not able to exploit this potential, whereas Facebook and Twitter have been able to at least capture the public’s imagination, and Apple has been able to focus on its demands.</p>
<h3>Google must Face the Book (and the Music)</h3>
<p>If Google is the Architect of Cloud 2.0, then Facebook is its face.  Google after all this appears as clueless as the Architect in the Matrix (don&#8217;t even consider Facebook as the Oracle &#8230; maybe Apple or Salesforce). Poor little Google caught between Facebook and Apple looks as out of place and lumbering as Microsoft and Yahoo at a search conference.  Somehow Google has to defend search and get Social as Microsoft had to defend the desktop and get the Cloud.  From Microsoft’s experience, it does not look good for Google nor the Silicon Valley techno-geeks that have been cheering on Google&#8217;s technological arrogance<a href="#_edn7">[vii]</a></p>
<p>In fact there is an ironic twist to our Pinky and the Brain metaphor. Facebook and Twitter, the scorned of the Valley Wag, are viewed as Pinky against Google as the Brain. Though he plays the part of the simpleton, it is Pinky that has the sardonic insights that cut through to the core of the Brain&#8217;s maniacal self-absorption. Will all the brain power at Google be able to work through it&#8217;s own short sightedness?  The indications are no &#8211; but that&#8217;s another episode.</p>
<h3>Don’t Count the Brain Out Just Yet</h3>
<p>To be fair, the Brain is tenacious both in cartoon and corporate forms.  Google still has the computational power and experience that Microsoft and Yahoo are only now attempting to match.  The social media companies – Facebook, Twitter, Diggs – are preoccupied with meeting demand, with Facebook able to expand application of its network to different services.  But can it all be done in blue with not even a glance to user experience.</p>
<p>To many pundits, from the Wave experience, Google has acquired capabilities that will eventually be spread to other offerings including a social offering.  So perhaps let Google regroup and keep an eye on them for another breakaway plan.  After all, Android is number one iDevice OS and Chrome is taking share from Microsoft IE.  That’s success. Right?</p>
<p><a href="http://wp.me/pqZC6-9U">I wonder what others are doing in the meantime?</a></p>
<div>
<h2>Works Cited</h2>
<p>Ars Technica Staff. (2010, 12-August). <em>Google Wave: why we didn&#8217;t use it</em>. From Ars Technica:  http://arstechnica.com/software/news/2010/08/google-wave-why-we-didnt-use-it.ars</p>
<p>Hölzle, U. (2010, 4-August). <em>Update on Google Wave</em>. From The Official Google Blog:  http://googleblog.blogspot.com/2010/08/update-on-google-wave.html</p>
<p>Hall, S. (2010, 10-August). <em>Google Wave: Knowing When to Fold &#8216;Em</em>. From IT Business Edge:  http://www.itbusinessedge.com/cm/community/news/bam/blog/google-wave-knowing-when-to-fold-em/?cs=42698</p>
<p>Keeling, J. (2010, 9-August). <em>The Real Story Behind Google&#8217;s Product Failures</em>. From The Motley  Fool:  http://www.fool.com/investing/high-growth/2010/08/09/the-real-story-behind-googles-product-failures.aspx</p>
<p>Mehrotra, P. (2010, 12-August). <em>Will Google get it right this time?</em> From HindustanTimes:  http://www.hindustantimes.com/Will-Google-get-it-right-this-time/Article1-584772.aspx</p>
<p>Swan, G. (2010, 6-August). <em>Google Wave Slayed by Facebook and Twitter</em>. From PCWorld:  http://www.pcworld.com/article/202722/google_wave_slayed_by_facebook_and_twitter.html?tk=hp_new</p>
<hr size="1" />
<div>
<p><a href="#_ednref">[i]</a> <a href="http://googleblog.blogspot.com/2010/08/update-on-google-wave.html">(Hölzle, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[ii]</a> <a href="http://www.pcworld.com/article/202722/google_wave_slayed_by_facebook_and_twitter.html?tk=hp_new">(Swan, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[iii]</a> <a href="http://www.itbusinessedge.com/cm/community/news/bam/blog/google-wave-knowing-when-to-fold-em/?cs=42698">(Hall, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[iv]</a> <a href="http://www.fool.com/investing/high-growth/2010/08/09/the-real-story-behind-googles-product-failures.aspx">(Keeling, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[v]</a> This was one of three points that Google felt was a major accomplishment in the Wave development, hinting at the use of semantics to help improve spell check suggestions.  The implications are astounding – an entire semantic web dedicated to spell checking.  The ramifications are mind-boggling: instead of the dozen suggested words that have nothing to do with what you were intending, it comes up with one or two suggestions that do apply – based upon the context of what you are writing.  Hopefully it picks up changes in discussions that occur often in real time better than the contextual ads that are places at the side of the text.  It is something I would not of thought of right off the top when envisioning a semantic web.  But having thought about it now – I can sort of see their point – not!  Not to say that a really good spell checker would be nice. Snark.</p>
</div>
<div>
<p><a href="#_ednref">[vi]</a> I am paraphrasing a rather decent breakdown of why Google Wave was not accepted from the editors of Ars Technical <a href="http://arstechnica.com/software/news/2010/08/google-wave-why-we-didnt-use-it.ars">(Ars Technica Staff, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[vii]</a> <a href="http://www.hindustantimes.com/Will-Google-get-it-right-this-time/Article1-584772.aspx">(Mehrotra, 2010)</a></p>
</div>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/609/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=609&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/11/breaking-oops-google-wave-a-fail/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/10/img_0393.jpg" medium="image">
			<media:title type="html">IMG_0393</media:title>
		</media:content>
	</item>
		<item>
		<title>Announcement: Launching LOAF with JAM Project!</title>
		<link>http://mindbeforeyoumine.com/2010/10/10/announcement-launching-loaf-with-jam-project/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/10/announcement-launching-loaf-with-jam-project/#comments</comments>
		<pubDate>Sun, 10 Oct 2010 21:01:19 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Asides]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[concurrency]]></category>
		<category><![CDATA[Elastic Computing]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[LOAF]]></category>
		<category><![CDATA[Map Reduce]]></category>
		<category><![CDATA[Web Architecture]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=668</guid>
		<description><![CDATA[Just launched on Project Hosting at Google the loafwithjam project.  LOAF stands for Live Object Application Framework and supports the Actor Model of Concurrency introduced in my last post. I have upload the documentation for codes that are similar to what have been in operation for web analytics collection and processing successfully for many years. <a href="http://mindbeforeyoumine.com/2010/10/10/announcement-launching-loaf-with-jam-project/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=668&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>Actors without the Lisp.</h2>
<p><a href="http://kraftt.files.wordpress.com/2010/10/img_0361.jpg"><img class="size-full wp-image-692 alignleft" title="Loaf w/ Jam Poster" src="http://kraftt.files.wordpress.com/2010/10/img_0361.jpg?w=640" alt=""   /></a>Just launched on Project Hosting at Google the <a href="http://code.google.com/p/loafwithjam/">loafwithjam</a> project.  LOAF stands for Live Object Application Framework and supports the Actor Model of Concurrency introduced in my last <a href="http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/">post</a>.  I have upload the documentation for codes that are similar to what have been in operation for web analytics collection and processing successfully for many years. The intent is to initiate a Live Object in a context server when a visitor first arrives at a site and process the events of that visitor in real time.  Hence the name: Live Objects.<span id="more-668"></span></p>
<p>This is an educational project for using the Actor Model of Concurrency in implementing Java Web Services that are linearly scalable over multiple servers and processors. As I described before: This is Hadoop on steroids, Map Reduce taken to Nth degree using Share Nothing Architecture approach.</p>
<p>Recently, my friend Mark (from <a href="http://mindbeforeyoumine.com/2010/03/15/a-day-being-mark/">A Day Being Mark</a>) pointed out to me that Actors have been making a big splash in the Lisp world again with the actors library for <a href="http://antoniogarrote.wordpress.com/2010/09/03/jobim-an-actors-library-for-clojure/">Closjure</a>. Original research in actors was implemented in Lisp, so LOAF is Actors without the Lisp.  After what demand is there for Lisping Thespians?</p>
<p>The java documentation can be download from the project. I suggest starting starting your reading with the LOAF package document, which will start your introduction to Live Objects, Actors and ACID.  More on actors after I complete the posting of the series on Cloud Computing. Enjoy!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/668/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/668/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/668/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=668&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/10/announcement-launching-loaf-with-jam-project/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/10/loaf-header.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/10/loaf-header.jpg?w=112" medium="image">
			<media:title type="html">LOAF Header</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/10/img_0361.jpg" medium="image">
			<media:title type="html">Loaf w/ Jam Poster</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing: Needs a New Programming Paradigm &#8230;</title>
		<link>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/#comments</comments>
		<pubDate>Sat, 09 Oct 2010 00:03:48 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[CCP]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[concurrency]]></category>
		<category><![CDATA[CSP]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Map Reduce]]></category>
		<category><![CDATA[Massive Distributed Computing]]></category>
		<category><![CDATA[Sullivan's Law]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=597</guid>
		<description><![CDATA[We have seen in a previous post how Map Reduce is analogous to how business and product managers plan and implement projects with Gantt Charts allowing a number of tasks to proceed independently in parallel and integrating their products at the end. A question from those outside of computer science might be: why has it taken this long to figure this out? In fact, if computers are so fast and capable, why don't they figure the quickest, most parallel way of executing multiple tasks all by themselves? <a href="http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=597&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>That has been around a very long time.</h2>
<p><strong> </strong></p>
<p><a href="http://kraftt.files.wordpress.com/2010/10/img_0341.jpg"><img class="alignright size-full wp-image-657" title="Cloud of Webs" src="http://kraftt.files.wordpress.com/2010/10/img_0341.jpg?w=640" alt=""   /></a>We have seen in a previous post<a href="#_edn1">[i]</a> how Map Reduce is analogous to how business and product managers plan and implement projects with Gantt Charts allowing a number of tasks to proceed independently in parallel and integrating their products at the end. A question from those outside of computer science might be: why has it taken this long to figure this out?  In fact, if computers are so fast and capable, why don&#8217;t they figure the quickest, most parallel way of executing multiple tasks all by themselves?</p>
<p><span id="more-597"></span></p>
<p>The answer represents a fascinating history of personalities, ambitions, and feuds that make up what we call computer science. First, computer science is not like any other science in that it is also an engineering discipline that often gets ahead of the science, and a business segment that often gets ahead of the engineering. As a result there is no repository of long-term memory so concepts often have to be rediscovered as the necessity of new situations arise or race to catch up with the hype.  A rose by any other name would be considered something entirely different!</p>
<p>What has changed now is the general recognition that having thousands if not 10s of thousands of computers applied to a problem (such as search) can have significant advantages over a super-computer though the latter is tightly coupled, efficient, and designed to be super!</p>
<p>The web discovers Sullivan&#8217;s Law<a href="#_edn2">[ii]</a>, which was defined in the mid 70&#8242;s! At the same time, a group at MIT under Hewitt<a href="#_edn3">[iii]</a> were developing and evangelizing the concepts of CCP derived from the Actor Model of Concurrency.  Also work at Carnegie Mellon by Bennett<a href="#_edn4">[iv]</a> involved development of ACID and distributed operating systems. All this was done at the same time that Dyjkstra<a href="#_edn5">[v]</a> and Hoare<a href="#_edn6">[vi]</a> were developing CSP. At the time, there was a great deal of exchange among these groups and the citations provided above give a flavor of the exchange at the time. In the early 80s, CSP was selected as relevant to distributed system problems at that time.</p>
<p>I have had the privilege of working with Sullivan, Hewitt and Bennett at different times in my career as an engineer<a href="#_edn7">[vii]</a>. It has puzzled me why their work and contributions have not been more prevalent in our work today. Until recently one could argue the best approach was to build efficient algorithms and rely on Moore&#8217;s Law to double processing throughput every 2 years from more transistors on faster chips<a href="#_edn8">[viii]</a>. Has anyone noticed that we have been holding at around 3 GHz for 5 years? The increased capacity has come from building multiple processors called cores on a chip. But then applications have to be able exploit parallel execution at the individual computer level. Not so difficult when there a few processors, there always a loose thread around looking for a processor, but with hundreds or thousands, one needs to stop and think.</p>
<p>More recently the computational capacity increase has come from distributing processors into everything from automobiles to cell phones, and interconnecting these via the Internet<a href="#_edn9">[ix]</a>. Can there be anyway that this global network of processors can be harnessed to do useful work? To take the next leap in processing capacity requires harnessing distributed computing, and Map Reduce is just a gross (coarse grained) application of this concept.</p>
<h3><strong>Concurrency in a Distributed World</strong></h3>
<p><strong> </strong></p>
<p>In Star Trek Next Generation (I am talking with fellow nerds here, right?), we have the Borg &#8211; a collection of assimilated races attached via cyborg appliances to a central controller that is actually the composite of all the Borg minds. All can access anything experienced by one. The Borg operate without individual identity and with relentless efficiency to absorb new races. In their organization and behavior the Borg represent one vision of distributed computing<a href="#_edn10">[x]</a>.</p>
<p>We humans, enemies of the Borg that to the Borg must be absorbed because we are so inefficient, represent another form of distributed computing<a href="#_edn11">[xi]</a>. In comparison to the Borg, we seem highly inefficient, requiring a great deal of support and because we are isolated and individual, unnecessary coddling and communication to get anything done. This nicely lays out the differences and contrasts in positions (including hostilities) of the different approaches to concurrent programming.</p>
<p>There are the Borgist, who insists on tightly coupled processing and who build MIMD and SIMD processors with shared memory and clock synced instruction execution to squeeze every concurrent advantage with mechanistic efficiency. Then there are us humans that attempt to use tools such as computers as extensions of ourselves to be more effective or productive, and despite our individuality and independent behaviors seem to at be able to organize to perform amazing feats, such as sending someone to the moon, sequencing the human genome, building the Internet or manufacturing tooth picks. What is the essential difference?</p>
<p>The difference is that in the Borg model everyone shares absolute knowledge of the global state and in the human model no one at any time has absolute knowledge of the global state. In the later case everyone works with the assumption of sufficient knowledge of relevant state. This is precisely how concurrency is applied to developing distributed applications. The concept of global shared state is eliminated!</p>
<p>This represents one derivation of the concept of no shared state from CCP Theory.  An independent derivation of the same concept (I told you that concepts have to be rediscovered) arises from attempting to unlock the shared constraints in data management systems called Share Nothing Architectures in the 90s<a href="#_edn12">[xii]</a>. Yet another line of investigation comes from systems theory and economics called <em>bounded rationality<a href="#_edn13">[xiii]</a></em>. The latter makes clear that concurrent programming is clearly more a systems discipline than a programming discipline, which will become more clear as we continue.</p>
<h3><strong>Relativity in a Closed Mechanistic World View</strong></h3>
<p><strong> </strong></p>
<p>The concept of no global shared state is gut wrenching to many in computer science on a number of different levels and for a number of different reasons. There are many consequences that result from this rule. For one, the concept of global time and ordered sequence of events has to be abandoned. Just as the transition from Newtonian to Relativistic physics absolute time is eliminated, the same is true in the transition from CSP to CCP.</p>
<p>Without absolute time, events in the network cannot be sequenced in absolute order everywhere in the network (unless one waits until all data is in, which can be itself an indeterminate time). But in the meantime, decisions and other events are being generated based upon incomplete knowledge of the global state. As a result, the sequence of events that are &#8220;observed&#8221; by a process is probabilistic and not guaranteed repeatable.</p>
<p>An example is vote counting. TV networks provide exit polling data based upon sampling; followed later with raw vote precinct counts as they are reported; and followed many weeks later with the certified recorded counts. Each count is different, but the critical threshold of who got the most votes many times can be determined from the exit poll sampling and partial precinct reported counts.</p>
<p>Is absolute certainty of the actual count relevant or the fact the threshold has been passed with high certainty the important fact?  This is a question that is often asked in designing decision systems. For example, in designing a distributed operating system, is it important that all processes know exactly the storage available or whether a critical threshold has been crossed such that access must be serialized and tightly coordinated.</p>
<p>In the rare cases when results are too close that a recount is necessary such as the ones in Florida and Minnesota, one becomes exposed to the messy world of real world data with hanging chads and double entries. Ambiguity and noise in voting results is difficult to comprehend much less accept when one believes in a mechanistic closed universe where every vote is counted and the process ensures the same count every time. Whenever the ambiguities of human behavior and the verities of infrastructure are introduced, the system is never closed and the processes must take these variations into consideration. This is fundamental to how all science is performed.</p>
<p>In the real world, for 99.999% of the cases, the exit poll and raw count numbers are sufficient to determine the winner and proceed with high confidence from that result. Both in the real world and in concurrent programming one can insist upon an event being executed simultaneously everywhere; actions preformed only when events are properly sequenced to generate repeatable results; or respond to events in the shortest time possible at a given confidence level. Just not all at the same time.</p>
<h3><strong>Massive Distributed Computing in an Open Internet</strong></h3>
<p><strong> </strong></p>
<p>For massive distributed computing to work over a global open system such as the Internet, one has to be very selective on when to insist upon global understanding of time or any other state parameter if one wants to leverage concurrency to increase computational throughput. To insist on Borg-like omniscience is neither practical nor desirable.</p>
<p>The result is a computer system that is programmed much as businesses are organized. Though businesses seem highly inefficient as a computational model, they do excel at making sure that all their employees (processors) are busy, and organizing them to work productively in parallel. Small start-ups of a few workers can get more done and quicker than one individual alone. Large organizations may seem to not have linear scalability but none-the-less can perform gargantuan enterprises not conceivable for an individual.</p>
<p>There is actually something more going on here than mere programming.  The small linear programs, chubby objects, are embedded into a system that can respond to the external environment and optimize to higher-level goals. Enterprises incorporate systems and subsystems that interact at different levels of communication and coordination allowing complex processes to be distributed to simpler processes operating in parallel. The ability to adapt and optimize is a feature of self-organization that can occur when concurrent components communicate forming feedback loops.  In computer science these “programs” are referred to as <em>connectionist networks</em> that are an out growth from artificial neural networks and require developing concurrent operating environments over a wide spectrum of couplings in both communications and shared state<a href="#_edn14">[xiv]</a>.</p>
<p>All this is accomplished by not insisting upon everyone including the president having a global view of the companies state, nor serializing access to shared assets. How this would be accomplished in programming a massively distributive system requires a different programming paradigm. One that identifies not only sequences of processing steps, but also the concurrency of processes and the dependencies of these processes to the completion of other tasks, much as how businesses define work break down structures.</p>
<h3><strong>Resilience Comes To The ForeFront Over Predictability</strong></h3>
<p><strong> </strong></p>
<p>The strongest counter argument by the closed systems advocates is why develop programs that can provide only probabilistic or worst unpredictable results.  The thing is we have already developed such programs without being aware we have crossed the line into open systems programming.</p>
<p>The current rage is data driven decisions where managers make decisions based upon analytic data and business intelligence concerning their business.  The data is assumed to follow all the data integrity and consistency policies of a &#8220;global&#8221; view of the business. Companies that insist upon the highest quality data following strict accounting or for online SOX compliance place human decision makers into the process that are neither strict nor consistent. In so doing the linear programs are now part of a non-linear if not unpredictable decision process, especially if the data collected and analyzed reflects the impact of decisions made by those managers.</p>
<p>The fact that humans are making the decisions does not make the enterprise more stable. Until one considers the entire combination of human and automated subsystems as one system, one will not be able to access resilience of the enterprise to competition nor it&#8217;s potential for growth.  From that system perspective one can see if adding automated decision processes will aid or hinder the performance of the business.  At that point one has introduced feedback loops and crossed the line into open system unpredictability.</p>
<p>In place of predictability, one wants to know the resilience of the system to respond to unexpected events.  Programs designed assuming a closed system with predictable inputs do not have much resilience when connected to an open environment. Would it not be better to build systems with an understanding of its behavior in real environment rather than expect predictable results in a fantasy environment? It is not a trick question! You would be amazed on the number that insist on the later.</p>
<h2><strong>Google makes it&#8217;s Move</strong></h2>
<p><strong> </strong></p>
<p>What is this paradigm is TBD<a href="#_edn15">[xv]</a>. However at this point Python can be considered as viable a candidate as C++, C#, or Objective C or even Java or Ruby. The critical question will be how finely concurrent units can be identified and how quickly these units can be integrated into systems that perform useful work. At the moment, most are considering computational units on the scale of individual servers<a href="#_edn16">[xvi]</a>. However for Google, the computational units are Objects held as chubby chunks in the Big Table<a href="#_edn17">[xvii]</a>. So Google has already broken away from the cloud-computing pylon, huddled around conventional notions of computation.</p>
<h2><strong>What can be done that will be New?</strong></h2>
<p><strong> </strong></p>
<p>So what could be done with thousands potentially millions of servers hosting millions potentially billions of chubby worker bees just waiting to be organized to perform your every whim?  Google has the servers and the engine to support chubby worker bees and has experience in some of the ways of organizing these worker bees over a number of applications including Google Analytics, Maps, Finance, and 60 other products<a href="#_edn18">[xviii]</a>.  Is this sufficient to propel a third wave of Web change? <a href="http://wp.me/pqZC6-9P">Well Google had a thought about that &#8211; NOT!</a></p>
<h2>Works Cited</h2>
<p>Agha, G., &amp; Hewitt, C. (1985). <em>Concurrent programming using actors: Exploiting large-scale  parallelism </em>. From SpringerLink:  http://www.springerlink.com/content/y1wl7q3342006720/</p>
<p>Banerjee, U. (2009, 26-November). <em>Cloud Computing Service: Amazon EC2 vs Google GAE</em>. From Cloud  Computing Journal: http://cloudcomputing.sys-con.com/node/1200706</p>
<p>Bhatia, A. (2007, 10-December). <em>Shared Nothing Architecture</em>. From Toolbox.com:  http://it.toolbox.com/wiki/index.php/Shared_Nothing_Architecture</p>
<p>Boyer, R., Chen, D., Gray, A., &amp; Lee, T. (1999,  30-November). <em>Operating Systems for the  WWW</em>. From Google Docs:  http://www.bennetyee.org/ucsd-pages/Courses/cse221.f99/OSSurveyF99/papers/boyer%2Cchen%2Cgray%2Clee.operating_systems_for_the_www.pdf</p>
<p>Business Dictionary. (n.d.). <em>Bounded Rationality</em>. From Business Dictionary:  http://www.businessdictionary.com/definition/bounded-rationality.html</p>
<p>Closson, K. (2007, 9-August). <em>Nearly Free or Not, GridSQL for EnterpriseDB is Simply Better Than  Real Application Clusters. It is Shared-Nothing Architecture After All! </em>.  From Kevin Closson’s Oracle Blog:  http://kevinclosson.wordpress.com/2007/08/09/nearly-free-or-not-gridsql-for-enterprisedb-is-simply-better-than-real-application-clusters-it-is-shared-nothing-architecture-after-all/</p>
<p>Dijkstra, E. W. (1975). <em>Guarded  commands, nondeterminacy and formal derivation of programs</em>. From ACM:  http://doi.acm.org/10.1145/360933.360975</p>
<p>Fay Chang, J. D. (2006, November). <em>Bigtable: A Distributed Storage System for Structured Data</em>. From  Google Research Publications: http://labs.google.com/papers/bigtable.html</p>
<p>Hewitt, C., &amp; Baker, H. (1977, 10-May). <em>Laws for Communicating Parallel Processes</em>.  From MIT: http://mit.dspace.org/bitstream/handle/1721.1/41962/AI_WP_134A.pdf?sequence=1</p>
<p>Hoare, C. (1978, August). <em>Communicating sequential processes</em>. From ACM:  http://portal.acm.org/citation.cfm?doid=359576.359585</p>
<p>Knorr, E., &amp; Gruman, G. (2009, 5-May). <em>What cloud computing really means</em>. From  Info World, Cloud Computing:  http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031</p>
<p>Kraft, T. (2010, 02-July). <em>Clouds of Hadoop: How Map Reduce Changed the World </em>. From Mind  Before You Mine: http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/</p>
<p>Kraft, T. (2010, 25-June). <em>Send In the Clouds: Old Metaphor Gets New Life</em>. From Mind Before  You Mine: http://mindbeforeyoumine.com/2010/06/25/send-in-the-clouds/</p>
<p>Kraft, T. T. (1989). Summary Proposal for the Development of  an Architecture Neutral Distribution Format based upon Communicating  Concurrent Processes. <em>Neurocomputing</em> <em>, 1(2)</em>, 23-28.</p>
<p>Meadows, D. H. (2008). <em>Thinking  in Systems &#8211; A Primer &#8211; .</em> (D. Wright, Ed.) River Junction, Vermont:  Chelsea Green Publishing.</p>
<p>Moore, G. (2010). <em>Moore&#8217;s  Law: Made real by Intel innovation</em>. From Intel:  http://www.intel.com/technology/mooreslaw/</p>
<p>Paolo Fogliata, M. T. (2008, 18-August). <em>Intelligence-ready network infrastructure:  An ecosystem to control third-party intelligence distribution close to nomadic  users</em>. From Wiley Online Library:  http://onlinelibrary.wiley.com/doi/10.1002/bltj.20306/abstract</p>
<p>Patton, P. C., &amp; Hersh, C. I. (2003, April). <em>Asynchronous Non-Preemptive Tasking for  Parallel and Distributed Systems</em>. From A Quantitative Methods/Computer  Science White Paper: http://140.209.112.90/WhitePapers/QMCS/ANT.html</p>
<p>Ricadela, A. (2007, 16-November). <em>Computing Heads for the Clouds</em>. From Bloomberg Businessweek:  http://www.businessweek.com/technology/content/nov2007/tc20071116_379585.htm</p>
<p>Sullivan, H., &amp; Bashkow, T. R. (1977). <em>A large scale, homogeneous, fully  distributed parallel machine, I</em>. From ACM:  http://doi.acm.org/10.1145/800255.810659</p>
<p>Swoyer, S. (2008, 08-July). <em>Is Cloud Computing the Next Disruptive Technology?</em> From ESJ:  http://esj.com/articles/2008/07/08/is-cloud-computing-the-next-disruptive-technology.aspx</p>
<div>
<hr size="1" />
<div>
<p><a href="#_ednref">[i]</a> <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[ii]</a> <a href="http://doi.acm.org/10.1145/800255.810659">(Sullivan &amp; Bashkow, 1977)</a></p>
</div>
<div>
<p><a href="#_ednref">[iii]</a> <a href="http://mit.dspace.org/bitstream/handle/1721.1/41962/AI_WP_134A.pdf?sequence=1">(Hewitt &amp; Baker, 1977)</a></p>
</div>
<div>
<p><a href="#_ednref">[iv]</a> John K. Bennett is citied in many papers for his work in distributed operating systems and is currently a professor at University of Colorado. An example paper summarizing various operating systems for the web includes a discussion of Prof. Bennett’s contribution. <a href="http://www.bennetyee.org/ucsd-pages/Courses/cse221.f99/OSSurveyF99/papers/boyer%2Cchen%2Cgray%2Clee.operating_systems_for_the_www.pdf">(Boyer, Chen, Gray, &amp; Lee, 1999)</a></p>
</div>
<div>
<p><a href="#_ednref">[v]</a> <a href="http://doi.acm.org/10.1145/360933.360975">(Dijkstra, 1975)</a></p>
</div>
<div>
<p><a href="#_ednref">[vi]</a><a href="http://portal.acm.org/citation.cfm?doid=359576.359585"> (Hoare, 1978)</a></p>
</div>
<div>
<p><a href="#_ednref">[vii]</a> Bennett was my first introduction to distributed computing when I worked on a joint DARPA project on distributed operating systems for the Navy.  I had been following Hewitt’s work ever since I learned of Actor’s from an old Byte magazine article in 1977.  At the time Actors and Objects ala Smalltalk were practically inseparable concepts because of the emphasis on communications (which has been carried over to Objective C). In the late 80’s I began seriously looking at Actor concurrency in developing Neural Network or Connectionist architectures and met Professor Hewitt at a neural network conference.  Apparently my advocating Actor’s peeked his interest in neural networks.  Later after this experience I met with Dr. Sullivan when I was brought in to consult with one of his companies.  At the time I was very much an efficiency wonk and at the same time trying to relate Actors to Ants.  For a couple days, Dr. Sullivan took me out to the “wood shed” for an attitude adjustment, which resulted in extensive notes on concurrency and the theory of computation.  All the time, he was chain smoking through the entire set of lectures.  After being rightly adjusted, I was able to combine what I learned from both Hewitt and Sullivan to develop a fully N-scalable app server based upon actor concurrency for processing web analytics data in one of the first Web Analytics as a Service offerings.</p>
</div>
<div>
<p><a href="#_ednref">[viii]</a> <a href="http://www.intel.com/technology/mooreslaw/">(Moore, 2010)</a> This is a great reference to Moore’s Law linking to important papers and history. As a cofounder of Intel, it can be viewed as Intel’s mission statement for the last 50 years.</p>
</div>
<div>
<p><a href="#_ednref">[ix]</a> <a href="http://onlinelibrary.wiley.com/doi/10.1002/bltj.20306/abstract">(Paolo Fogliata, 2008)</a></p>
</div>
<div>
<p><a href="#_ednref">[x]</a> Borg distributed computing model – tightly coupled synchronous processors with shared memory with parallel execution at the instruction level either Same Instruction Multiple Data (SIMD) or Multiple Instructions Multiple Data (MIMD).</p>
</div>
<div>
<p><a href="#_ednref">[xi]</a> Human distributed computing model – loosely coupled asynchronous distributed.</p>
</div>
<div>
<p><a href="#_ednref">[xii]</a> <a href="http://kevinclosson.wordpress.com/2007/08/09/nearly-free-or-not-gridsql-for-enterprisedb-is-simply-better-than-real-application-clusters-it-is-shared-nothing-architecture-after-all/">(Closson, 2007)</a></p>
</div>
<div>
<p><a href="#_ednref">[xiii]</a> <a href="http://www.businessdictionary.com/definition/bounded-rationality.html">(Business Dictionary)</a> also for context of the concept in System’s Theory see <a href="http://www.amazon.com/Thinking-Systems-Donella-H-Meadows/dp/1603580557/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1286581593&amp;sr=1-1">(Meadows, 2008)</a></p>
</div>
<div>
<p><a href="#_ednref">[xiv]</a> (Kraft T. T., 1989)</p>
</div>
<div>
<p><a href="#_ednref">[xv]</a> With <a href="http://wp.me/pqZC6-aM">LOAF with JAM</a> I cover how adding actor language extensions to Java and other languages can meet all the requirements of a massive distributed computing paradigm. See also <a href="http://www.springerlink.com/content/y1wl7q3342006720/">(Agha &amp; Hewitt, 1985)</a></p>
</div>
<div>
<p><a href="#_ednref">[xvi]</a> <a href="http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031">(Knorr &amp; Gruman, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[xvii]</a> <a href="http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031">(Knorr &amp; Gruman, 2009)</a> also <a href="http://cloudcomputing.sys-con.com/node/1200706">(Banerjee, 2009)</a> and <a href="http://esj.com/articles/2008/07/08/is-cloud-computing-the-next-disruptive-technology.aspx">(Swoyer, 2008)</a> for potential for disruption.</p>
</div>
<div>
<p><a href="#_ednref">[xviii]</a> <a href="http://labs.google.com/papers/bigtable.html">(Fay Chang, 2006)</a></p>
</div>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/597/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/597/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/597/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=597&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/10/img_0341.jpg" medium="image">
			<media:title type="html">Cloud of Webs</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing From Upside Down</title>
		<link>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-from-upside-down/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-from-upside-down/#comments</comments>
		<pubDate>Fri, 08 Oct 2010 21:42:24 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Big Table]]></category>
		<category><![CDATA[CCP]]></category>
		<category><![CDATA[Chubby Locks]]></category>
		<category><![CDATA[CSP]]></category>
		<category><![CDATA[GFS]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Wave]]></category>
		<category><![CDATA[Massive Distributed Computing]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=595</guid>
		<description><![CDATA[Google continues to snub their noses to conventional wisdom and build out their cloud their way. What is going on here?  If anything history has taught us - don't ignore the big brain in the corner. So we continue our irreverent look at Google as the Brain's nefarious plan to take over the world continues. <a href="http://mindbeforeyoumine.com/2010/10/08/cloud-computing-from-upside-down/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=595&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2><strong>Google: Pinky and the Brain</strong></h2>
<h3><strong>World Domination Plan 137:<br />
Phase 3: Distract before Conquering</strong></h3>
<p><strong> </strong></p>
<p><img class=" alignleft" title="Pinky and the Brain World Conquest" src="http://www.mycartoonnetwork.com/attachments/Image/pinky-and-the-brain.jpg" alt="" width="300" height="300" /></p>
<p>While everyone is building out their Hadoop farms<a href="#_edn1">[i]</a> and adopting Map Reduce to establish elastic computing<a href="#_edn2">[ii]</a>, Google is busy building (and patenting) some strange new technologies with names from a Dick Tracy line up &#8211; Gears, Big Table and Chubby Locks. Also it is busily building the biggest mash up heretofore envisioned called Google Wave allowing individuals across the web to message and collaborate in real time &#8230; at a keystroke level<a href="#_edn3">[iii]</a>!</p>
<p><span id="more-595"></span></p>
<p>With Map Reduce, Google was able to process the entire content of the web<a href="#_edn4">[iv]</a>. However, map reduce is a batch technology that gains concurrency advantage by mapping resources to perform dedicated parallel tasks on batches of data<a href="#_edn5">[v]</a>. In parallel computing parlance one might call this Same Task Multiple Data (STMD), where not only is the same task performed on multiple computers, but within a computer the task processes multiple instances (rows) of data in batch. The dual architecture of Multiple Tasks Same Data (MTSD) is also conceivable with map reduce but a more rare parallel computing strategy<a href="#_edn6">[vi]</a>.</p>
<p>So what does it take to move beyond this batch non-real-time limitation and be able to process any task, any time, anywhere at massive demand scale? The answer is Massive Distributive Computing (MDC). This requires the transition from Communicating Sequential Processes (CSP)<a href="#_edn7">[vii]</a> to Communicating Concurrent Processes (CCP)<a href="#_edn8">[viii]</a>.  Don&#8217;t let the big words bother you. They will be all explained in a moment, but first what is concurrency.</p>
<h3><strong>Sequence in a Concurrent World</strong></h3>
<p><strong> </strong></p>
<p>Remember in a previous post<a href="#_edn9">[ix]</a> we likened demands on storage to that of a cash register in a super market. By separating storage into the equivalent of aisles, we can get concurrent processing advantage to reduce queues of customers, the analog to processes accessing storage. The assumption is that any of the cashiers with their registers can service any customer. Once the customer is at the register, performing checkout, the cashier should complete the checkout, without interruption, before moving on to the next customer.</p>
<p>The technical term is that checkout is an <em>atomic transaction<a href="#_edn10">[x]</a></em>. If the checkout was interrupted so that the cashier had to checkout a more important customer (called <em>pre-emption</em>), besides angering the pre-empted customer, the cashier must rollback to the state prior to checkout, as though the customer had never started the process (<em>rollback on pre-emption</em>). So when the peeved customer leaves the store in a tizzy, neither he nor the store has lost any money in the transaction. A similar thing would happen if at the end, the customer discovered she had no money to complete the transaction (<em>rollback on exception</em>).</p>
<p>Though pre-emption and rollback are relatively rare in a supermarket, these are frequent occurrences in processing data where there is large number of processes sharing resources such as storage or state information such as current storage capacity. To handle these cases, each process must receive a lock, similar to taking a number at the deli to get service<a href="#_edn11">[xi]</a>. The customer is free to do something else but usually just waits to be called.</p>
<h3><strong>Waiting on the Internet Highway</strong></h3>
<p><strong> </strong></p>
<p>So our first clue that Google is moving to massive distributive computing is Chubby Access Locks<a href="#_edn12">[xii]</a>. Chubby is a mechanism that allows a process anywhere in the world to obtain a lock (take a number) on a resource anywhere else in the network<a href="#_edn13">[xiii]</a>.</p>
<p>In a typical database management system the shared data elements can be very fine grained (down to individual data elements).  For a distributed system, the overhead of obtaining locks on individual elements is not practical, so blocks of data are more coarse grained or chunkier, hence locks for sequencing access to chubby chunks of data.</p>
<p>Not the obese chunks represented in a batch file processed by Map Reduce, but pleasantly zaftig such as perhaps an individual row of data in a bulk file.  Furthermore we don’t have to wait for an accumulation of chubby chunks to make up one obese batch to get concurrency advantage in increase throughput. We can process the data chunks as they arrive &#8211; asynchronously!</p>
<p>The discussion so far is typical of the concerns covered under Communicating Sequential Processes (CSP). The communication comes from the mechanism the notifies a process when it can safely proceed in it&#8217;s access of a shared resource and sequential (or serialized) access to the resource is the consequence of the communication. This also defines the communications between processes. The process <em>rendezvous</em> with the resource process and communicate back and forth (called a <em>protocol</em>) until the initiating process is done with the resource and relinquishes the lock.</p>
<p>Though almost all books on concurrent programming begin with CSP, the result is not parallel execution but serial execution of tasks that share a common resource. As we found out, sequential processing reduces the concurrency advantage over performing processes in parallel without locks. To reduce sequencing and increase concurrent leverage and likewise increase throughput performance, one needs to reduce the need for locks as well as the occurrences of pre-emption and rollback. This is covered in the discipline of Communicating Concurrent Processes (CCP) that is based upon a different model of concurrency<a href="#_edn14">[xiv]</a>.</p>
<h3><strong>Rest Stops on the Internet Highway</strong></h3>
<p><strong> </strong></p>
<p>Relational database systems such as Oracle or MySQL are archetypes of CSP systems where even with grid implementations they can quickly extinguish any concurrency advantage. These implement the Relational Model for data independence<a href="#_edn15">[xv]</a>. Their objective is to break chunks of data into it&#8217;s basic elements such that a value is stored in one place only and referenced in all other places through unique identifiers called reference keys or indices.  In this way a change in a data value propagates consistently to all references of that value.</p>
<p>From a concurrent processing view this is analogous to running a taxi cab company, where every time a taxi comes in, it is broken down into it&#8217;s component parts and stored in racks (tables). Then when a cab is sent out, it must be reassembled from the parts available in the racks. In most cases, this is not an attractive business model<a href="#_edn16">[xvi]</a>. It is tempting to apply this model to software or data, where for example, newer parts can be swapped in or same parts used for different purposes (constructing more buses than taxis) to meet demand. But even in these cases it not appropriate to insist upon the objects (the chunks of data constructed from data elements) to be broken down into parts for storage each time. Especially if the object is just resting in cache, ready to be called upon soon to perform more work. Another strategy has to be considered.</p>
<p><img class="alignright" title="The World for the Taking" src="http://www.eatmedaily.com/wordpress/wp-content/uploads/2010/03/Pinky-And-The-Brain.jpg" alt="" width="200" height="191" />So the second clue that Google has developed an MDC system is Big Table<a href="#_edn17">[xvii]</a>. Big Table deals with chubby chunks of structured data that can be anything (but typically represents an object state) held in a flat table distributed similar to files in GFS<a href="#_edn18">[xviii]</a>.  It is a concurrency enabler in CCP<a href="#_edn19">[xix]</a>. At least that is the potential. To realize the potential requires an entirely different method of defining and developing apps. <a href="http://wp.me/pqZC6-9D">The company that gets this right rules the world!</a></p>
<h2>Works Cited</h2>
<p>Agha, G., &amp; Hewitt, C. (1985). <em>Concurrent programming using actors: Exploiting large-scale  parallelism </em>. From SpringerLink:  http://www.springerlink.com/content/y1wl7q3342006720/</p>
<p><em>Announcing  Amazon Elastic MapReduce </em>. (2009, 2-April). From Amazon News: http://aws.amazon.com/about-aws/whats-new/2009/04/02/announcing-amazon-elastic-mapreduce/</p>
<p>Burrows, M. (2006, 7-November). <em>The Chubby Lock Service for Loosely-Coupled Distributed Systems</em>.  From Google Research Publications: http://labs.google.com/papers/chubby.html</p>
<p>Collmeyer, A. J. (1972). <em>Implications  of data independence on the architecture of database management systems</em>.  From ACM: http://doi.acm.org/10.1145/800295.811495</p>
<p>Dean, J., &amp; Ghemawat, S. (2004, 15-December). <em>MapReduce: Simplified Data Processing on  Large Clusters</em>. From Google Research Publications:  http://labs.google.com/papers/mapreduce.html</p>
<p>Fay Chang, J. D. (2006, November). <em>Bigtable: A Distributed Storage System for Structured Data</em>. From  Google Research Publications: http://labs.google.com/papers/bigtable.html</p>
<p>Hewitt, C., &amp; Baker, H. (1977, 10-May). <em>Laws for Communicating Parallel Processes</em>.  From MIT:  http://mit.dspace.org/bitstream/handle/1721.1/41962/AI_WP_134A.pdf?sequence=1</p>
<p>Hoare, C. (1978, August). <em>Communicating sequential processes</em>.  From ACM: http://portal.acm.org/citation.cfm?doid=359576.359585</p>
<p>Kraft, T. (2010, 02-July). <em>Clouds of Hadoop: How Map Reduce Changed the World </em>. From Mind  Before You Mine: http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/</p>
<p>Kraft, T. (2010, 25-June). <em>Send In the Clouds: Old Metaphor Gets New Life</em>. From Mind Before  You Mine: http://mindbeforeyoumine.com/2010/06/25/send-in-the-clouds/</p>
<p>Sullivan, H., &amp; Bashkow, T. R. (1977).  <em>A large scale, homogeneous, fully  distributed parallel machine, I</em>. From ACM:  http://doi.acm.org/10.1145/800255.810659</p>
<p>Wikipedia. (n.d.). <em>Paxos  Algorithm</em>. From Wikipedia: http://en.wikipedia.org/wiki/Paxos_algorithm</p>
<p><em>Yahoo!  Advances Hadoop From Science to the World&#8217;s Largest Internet Deployment to  Mainstream </em>. (2010, 29-June). From Yahoo! News:  http://labs.yahoo.com/news/426</p>
<div>
<hr size="1" />
<div>
<p><a href="#_ednref">[i]</a> <a href="http://labs.yahoo.com/news/426">(Yahoo! Advances Hadoop From Science to the World&#8217;s  Largest Internet Deployment to Mainstream , 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[ii]</a> <a href="http://aws.amazon.com/about-aws/whats-new/2009/04/02/announcing-amazon-elastic-mapreduce/">(Announcing Amazon Elastic MapReduce , 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[iii]</a> Yes, I know that Google has recently shelved the project.  Like Pinky and the Brain, not all plans are successful, and this case the reason for failure is as illustrative and nuanced as a PatB episode. This will be discussed more later.</p>
</div>
<div>
<p><a href="#_ednref">[iv]</a> <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[v]</a> <a href="http://labs.google.com/papers/mapreduce.html">(Dean &amp; Ghemawat, 2004)</a></p>
</div>
<div>
<p><a href="#_ednref">[vi]</a> Besides this leads down the slippery slope to Multiple Tasks Multiple Data (MTSD) or even worst Different Tasks Different Data (DTDD), which is beyond the capabilities of MR leading to Massive Distributed Computing (MDC).</p>
</div>
<div>
<p><a href="#_ednref">[vii]</a> <a href="http://portal.acm.org/citation.cfm?doid=359576.359585">(Hoare, 1978)</a></p>
</div>
<div>
<p><a href="#_ednref">[viii]</a> <a href="http://mit.dspace.org/bitstream/handle/1721.1/41962/AI_WP_134A.pdf?sequence=1">(Hewitt &amp; Baker, 1977)</a></p>
</div>
<div>
<p><a href="#_ednref">[ix]</a> <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[x]</a> The transaction completes without interruption or roles back to the initial state. There are no intermediate states allowed, hence the transition is elemental or atomic.</p>
</div>
<div>
<p><a href="#_ednref">[xi]</a> There are several schemes for coordinating access to a shared object (mutex, monitors, objects), but they all require sequencing the access to the shared resource and holding a key or token while accessing the resource which must be passed back at the end of the transaction.</p>
</div>
<div>
<p><a href="#_ednref">[xii]</a> <a href="http://labs.google.com/papers/chubby.html">(Burrows, 2006)</a></p>
</div>
<div>
<p><a href="#_ednref">[xiii]</a> Chubby from the paper seems to be confined primarily to a cluster of approximately 10,000 servers, but these clusters can communicate within and across centers, theoretically Chubby as a Name Service can communicate outside of the clusters.</p>
</div>
<div>
<p><a href="#_ednref">[xiv]</a> <a href="http://www.springerlink.com/content/y1wl7q3342006720/">(Agha &amp; Hewitt, 1985)</a></p>
</div>
<div>
<p><a href="#_ednref">[xv]</a> <a href="http://doi.acm.org/10.1145/800295.811495">(Collmeyer, 1972)</a></p>
</div>
<div>
<p><a href="#_ednref">[xvi]</a> Dr. Sullivan (1991) private communications with author</p>
</div>
<div>
<p><a href="#_ednref">[xvii]</a> <a href="http://labs.google.com/papers/bigtable.html">(Fay Chang, 2006)</a></p>
</div>
<div>
<p><a href="#_ednref">[xviii]</a> The Google term for breaking up a monolithic database into multiple pieces is called <em>sharding</em>, where each piece distributed in the file system is called a <em>shard</em>.</p>
</div>
<div>
<p><a href="#_ednref">[xix]</a> Sharding allows data base transactions to be executed in parallel on different shards at the same time. So part of an efficient parallel data base access strategy is to attempt to reduce multiple hits on the same shard during a query.</p>
</div>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/595/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/595/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/595/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=595&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/08/cloud-computing-from-upside-down/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://www.mycartoonnetwork.com/attachments/Image/pinky-and-the-brain.jpg" medium="image">
			<media:title type="html">Pinky and the Brain World Conquest</media:title>
		</media:content>

		<media:content url="http://www.eatmedaily.com/wordpress/wp-content/uploads/2010/03/Pinky-And-The-Brain.jpg" medium="image">
			<media:title type="html">The World for the Taking</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing: 3 Reasons Why Analytics Should Care</title>
		<link>http://mindbeforeyoumine.com/2010/10/07/cloud-computing-3-reasons-why-analytics-should-care/</link>
		<comments>http://mindbeforeyoumine.com/2010/10/07/cloud-computing-3-reasons-why-analytics-should-care/#comments</comments>
		<pubDate>Fri, 08 Oct 2010 03:17:40 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Cloud Hosting]]></category>
		<category><![CDATA[Glen Beck]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Google Wave]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Interwoven]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[Justin Bieber]]></category>
		<category><![CDATA[Larry Ellison]]></category>
		<category><![CDATA[Map Reduce]]></category>
		<category><![CDATA[Mark Hurd]]></category>
		<category><![CDATA[Marshall McLuhan]]></category>
		<category><![CDATA[Massive Distributed Computing]]></category>
		<category><![CDATA[Tealeaf]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[Web Analytics as a Service]]></category>

		<guid isPermaLink="false">http://mindbeforeyoumine.com/?p=618</guid>
		<description><![CDATA[From my blog analytics, I can actually see the eyes rolling as my audience quickly skirts my Cloud Computing posts to go to the old stuff (back when I was funny). In some ways I understand the ambivalence but I will give you three reasons to reconsider that ambivalence. As you may be aware, I am not big with posts that give lists of simple answers that can be quickly absorbed and just as easily forgotten.  In this case will make exception and try not to spin the topic into simplistic babble. <a href="http://mindbeforeyoumine.com/2010/10/07/cloud-computing-3-reasons-why-analytics-should-care/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=618&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>I Can See Your Eyes Rolling</h2>
<p><a href="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg"><img class="alignright size-medium wp-image-589" title="HAL Cloud" src="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=225&#038;h=300" alt="" width="225" height="300" /></a>From my blog analytics, I can actually see the eyes rolling as my audience quickly skirts my Cloud Computing posts to go to the old stuff (back when I was funny).  In some ways I understand the ambivalence. I perhaps like you are not overtaken by awe with virtualization and utility computing that defines cloud computing today.  I would call this more <em>cloud hosting</em> than computing. My urge is to go beyond this in an attempt to grasp the computing part of the cloud. Understand what new capabilities may be facilitated and computations performed.  So why should you care?</p>
<p><span id="more-618"></span></p>
<p>I know.  Why be concerned with the future when we barely understand the present (from an analytics perspective).  What happens to our data as more customers go mobile?  How are we suppose to deal with social media and mashup tools such as Kickin or Cooliris or Yoono, which was the topic of my first blog post <a href="http://mindbeforeyoumine.com/2010/03/12/how-web-analytics-sees-reality/">“How Web Analytics Sees Reality”</a>?   How do we combine brick and mortar marketing with on-line marketing?  Is it all new or the old repackaged with a new name?   Is Viral Marketing just another name for publicity stunts?</p>
<p>My natural instinct when I seem to be bobbing up and down like a cork on an ocean in a hurricane of change is to attempt to see the larger picture in play.  What is turbulence (noise) and what is a meaningful trend?  I found out recently (from reading Donella Meadows’ <a href="http://www.amazon.com/Thinking-Systems-Donella-H-Meadows/dp/1603580557/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1286497497&amp;sr=1-1">“Thinking in Systems – A Primer -”</a>) that this is an aspect of “Thinking in Systems”, looking at events as part of larger system processes with longer time-lines.  As we move from reactive analytics (what happened and why?) to predictive analytics (what will happen and how much?), this is an instinct and discipline that should be nurtured by all in analytics.</p>
<p>But returning to Cloud Computing and why we as analysts should care and have some understanding, I will give three reasons.  As you may be aware, I am not big with posts that give lists of simple answers that can be quickly absorbed and just as easily forgotten.  In this case will make exception and try not to spin the topic into simplistic babble.</p>
<h3>1. IT is Part of Our Job Description</h3>
<p>Someday if not already you will find yourself across the table with someone in IT (at Yahoo! the particularly stubborn ones were aptly called Paranoids) whose job description is to maintain the <em>status quo</em> while your objective will be to change the <em>quo</em>.  He will have no reason or motivation to understand what you are trying to do, so it will be up to you be the bridge from your problem into what IT is concerned with.</p>
<p>You will not necessarily have to write the code or even design the solution.  But you will need to cross the bridge and present options that IT would be interested in understanding and following up with action.   Cloud Computing and its concern with scalability, availability along with elasticity and utility presents a lot of options that must be understood by all parties.  This includes us data wranglers, since data movement and processing are significant aspects effecting these options.  Leading us to reason two:</p>
<h3>2. Analytics is typically the First User of Cloud Computing</h3>
<p>Some might argue pornography as the first user but all the clicks to all those sites had to be counted by a much larger entity. Web analytics has always had to scale larger than its largest customers.  EBay is only a portion of Omniture’s traffic.  They don’t have to serve all the content but they do need to record all the transactions.</p>
<p>Having started in this business at the beginning of the switch to Web Analytics as a Service, I have had first hand experience in meeting collection and processing demand. This meant implementing map reduce before the term was even invented.  In my case, we developed appservers based upon the actor model of concurrency – Map Reduce on steroids.  It was the same struggle for all the other analytic companies.  If you want to go really deep, I have published the documentation for the system I developed called <a href="http://wp.me/pqZC6-aM">LOAF with JAM</a>.</p>
<p>Even the large companies I worked for (Yahoo, Overture, EBay), processing and warehousing data is a central aspect of their business model.  This is one of the reasons that Yahoo! has gone <a href="http://labs.yahoo.com/news/426">“big” </a>on Hadoop server farms having lashed together some of the largest outside of Google.</p>
<p>Even Google Analytics takes a small niche analytics company and hosts them in Google’s cloud to become the commodity provider of web analytics to most of the world.  This has not abated! I recently meet with a startup with relatively few customers (couple 100) but yet has collected several petabytes of data that must be mined and reported. All these are possible through the evolving discipline of distributed (i.e. cloud) computing.</p>
<p>Now most of us can have a good career simply trying to get IT to put Web Analytic Provider tags on our sites.  At some point we may be wildly successful and actually get the business to look at the data and base decisions upon the insights derived from that data.  At which time you will be cubby holed with IT to get new capabilities on line.</p>
<p>At some further point, the business is going to out grow the standard offerings and attempt to develop in-house analytic capabilities and tools.  This most likely will also become the business’s introduction to cloud computing platforms such as Hadoop or data warehouses such as Teradata or Interwoven or Customer Experience Management such as Tealeaf.  As the web analytics person you will be in the middle of this, maybe even the cause of it.</p>
<h3>3. Cloud Computing is the Reality We Are Attempting to Measure</h3>
<p>In McLuhan-esque terms – <a href="http://mindbeforeyoumine.com/2010/07/12/mcluhans-tetrad-applied-to-internet/">Cloud Computing is the Medium</a> and as we all know, <a href="http://individual.utoronto.ca/markfederman/article_mediumisthemessage.htm">the Medium is the Message</a>. Who knew?  You might have thought that Google Search was the message, or Facebook or Twitter or Foursquare or Wrrl or maybe Farmville.  These are but ephemeral manifestations enabled by the underlying medium.  Ultimately it is the medium that changes us and not specific babbling that crosses the medium.</p>
<p>The message is not Justin Bieber but the fact that an entity that happens to be Justin Bieber can consume <a href="http://mashable.com/2010/09/07/justin-bieber-twitter/">3% of all Twitter’s infrastructure</a> and that with all the freedoms that we can exercise, we as a mass can coalesce around an entity such as Justin Bieber.</p>
<blockquote><p>Justin Bieber is the Britney Spears of Twitter, who was the Madonna of Search, who was the Cher of MTV, who was a hippie.</p></blockquote>
<p>The cloud has manifested itself in Web 2.0 first as search and then as social networking and micro-blogging.  It is now mobile and in the future will become something else. It will be advancements in computing more and faster that will drive the medium to other manifestations in Web 3.0.</p>
<p>Is Cloud Hosting sufficient to change our reality by enabling all the X as a Service providers to finally have a viable business model or does the Cloud have to actually do work?  It appears that an entire economy is growing around apps empowered by the cloud. What is the potential of these AppStores harnessing cloud computing to shape our world? These are questions I am trying to understand.</p>
<p>The reason why I believe it is important to understand is that our first duty in analytics is to understand the reality we are trying to measure and then determine how we track and measure that reality.  Advancements in cloud computing will be the enabler of all the realities we will be tracking in the future.  Perhaps this is a more fundamental perspective than most would want to take, but I hope that my contributions will reduce the effort for my readers.</p>
<h2>What is Coming!</h2>
<p>It has been an interesting journey, which started in mid summer.  I thought I saw a narrative arc for researching and organizing the material for understanding cloud computing, but then events intervened and converted it into more of a corkscrew.</p>
<p>It started with <a href="http://googleblog.blogspot.com/2010/08/update-on-google-wave.html">Google’s dropping Wave</a> in August, which to me was the most ambiguous mash-up to date.  It illustrated what could be accomplished with Google&#8217;s massive computing power. Why it failed is a proverb of our times, cautioning us on our punditry about the future.  More recently is Larry Ellison’s brouhaha with <a href="http://www.wired.com/epicenter/2010/09/former-hp-ceo-mark-hurd-lands-on-his-feet-at-oracle/">HP over Mark Hurd</a> and Oracle’s announcement of their <a href="http://searchoracle.techtarget.com/news/2240022616/Oracle-Exalogic-is-the-companys-cloud-in-a-box">“Cloud in a Box”</a>.  Don’t want to go all Glen Beck but all these events are related.</p>
<p>The result is a white-paper that consists of over 70 references and 40 citations that covers the conventional view of cloud computing [<a title="Cloud Computing from Both Sides Now" href="http://mindbeforeyoumine.com/2010/07/19/cloud-computing-from-both-sides-now/">here</a>] and what Google is doing that goes beyond conventional approaches [<a title="Cloud Computing from Upside Down" href="http://mindbeforeyoumine.com/2010/10/08/cloud-computing-from-upside-down/">here</a>].  I will admit that I do have an agenda or primary thesis: that cloud computing is about both virtual hosting AND organizing these virtual computation units into massive distributed computing networks.  In other words, cloud computing is about computing not merely hosting.</p>
<p>So there is a post that covers Massive Distributed Computing and its features that change radically how we approach computing in the future [<a title="Cloud Computing: Needs a New Computing Paradigm ..." href="http://mindbeforeyoumine.com/2010/10/08/cloud-computing-needs-a-new-computing-paradigm/">here</a>]. Finally we take this new found perspective for a test drive looking at the surprising failure of <a href="http://mindbeforeyoumine.com/2010/10/11/breaking-oops-google-wave-a-fail/">Google Wave</a> and Oracles assertion of a <a href="http://mindbeforeyoumine.com/2010/10/12/oracle-announces-a-cloud-in-a-box/">Cloud in a Box</a>.</p>
<p>You may also want to take a look at those post that you  might have avoided &#8211; <a href="http://mindbeforeyoumine.com/2010/06/25/send-in-the-clouds/">&#8220;Send in the Clouds&#8221;</a> and <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">&#8220;Clouds of Hadoop&#8221;</a>.</p>
<p>If anyone would like a copy of the complete white paper in Word format, please email <a href="timkraf@yahoo.com">me</a> and I will provide a link with no further hassle or obligations.  Otherwise, I hope you enjoy the series and find some value in the references and narrative.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/618/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/618/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/618/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=618&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/10/07/cloud-computing-3-reasons-why-analytics-should-care/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=225" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing From Both Sides Now</title>
		<link>http://mindbeforeyoumine.com/2010/07/19/cloud-computing-from-both-sides-now/</link>
		<comments>http://mindbeforeyoumine.com/2010/07/19/cloud-computing-from-both-sides-now/#comments</comments>
		<pubDate>Mon, 19 Jul 2010 23:10:50 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Availability]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Elastic Computing]]></category>
		<category><![CDATA[Scalability]]></category>
		<category><![CDATA[Sullivan's Law]]></category>
		<category><![CDATA[utility computing]]></category>
		<category><![CDATA[Virtual Hosting]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[Web Architecture]]></category>
		<category><![CDATA[Web Services]]></category>

		<guid isPermaLink="false">https://kraftt.wordpress.com/?p=574</guid>
		<description><![CDATA[This is an review of how cloud computing is defined today and what the pundits have to say about its future. Currently it is confined to IT and the ability to move enterprise functions into the cloud onto virtual servers that can be allocated to meet demand. But what is the effect beyond IT? <a href="http://mindbeforeyoumine.com/2010/07/19/cloud-computing-from-both-sides-now/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=574&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2><strong>It&#8217;s Effects Beyond IT</strong></h2>
<p><a href="http://kraftt.files.wordpress.com/2010/07/img_0320.png"><img class="alignright size-full wp-image-637" title="Grid and Slice Cloud" src="http://kraftt.files.wordpress.com/2010/07/img_0320.png?w=640" alt=""   /></a>Cloud computing is a concept best understood within the context of IT<a href="#_edn1">[i]</a>. Its primary effect on business is the potential significant saving in capital expenditures (CAPEX) and potentially maintenance (OPEX)<a href="#_edn2">[ii]</a> of IT’s hosting assets. This is due to leveraging the scale of large server farms, and the recent ability to allocate slices of CPU, memory and storage on demand within these farms<a href="#_edn3">[iii]</a>.</p>
<p>This is sometimes referred to as <em>elastic computing</em> or <em>utility computing</em>. <em>Elastic</em> in the sense that additional resources can be allocated to meet periods of peak demand<a href="#_edn4">[iv]</a> and <em>utility</em> in that one pays only for resources used<a href="#_edn5">[v]</a>. So this would be a boon for startups and for new ventures within established companies to launch new products and services with significantly reduced startup expenditures and time to market<a href="#_edn6">[vi]</a>.<span id="more-574"></span></p>
<h3>Scalability and Availability At the Core</h3>
<p>For large Internet companies with stable traffic the benefits quickly diminish as the enterprise can build at a greater long term cost savings what is referred to as <em>internal clouds, </em>which in similar fashion integrate all resources as a single maintained entity through virtualization<a href="#_edn7">[vii]</a>.  In this sense cloud computing, whether external or internal, refers to computing and infrastructure resources that are configured for high <em>scalability</em> and <em>availability</em>.</p>
<p><em>Scalability</em> has already been discussed in a previous post<a href="#_edn8">[viii]</a> and is a significant aspect of the second surge in cloud computing or Web 2.0.  <em>Availability</em> is a related concept dealing with strategies for completing processes when computers and communications fail. Typically the strategies require duplicating processes on multiple computers in parallel and deciding at the completion of each process the correct result from these multiple sources<a href="#_edn9">[ix]</a>. Duplicating processes is a form of concurrent programming and follows the same principles as scalability, including Sullivan’s Law<a href="#_edn10">[x]</a> &#8211; it’s the high availability and not efficiency that is the critical factor<a href="#_edn11">[xi]</a>.</p>
<h3>An Enterprise Platform with Broad Application</h3>
<p>Cloud computing as defined above is neither a strikingly new business model nor novel technology breakthrough. After all, free or low cost hosting has been supported from the very beginning. What has changed is the experience of large Internet companies such as Google, Amazon, EBay, and Yahoo and more recently Facebook and Twitter in dealing with the scalability of servicing high consumer demand and streamlining the hardware / software platform cost<a href="#_edn12">[xii]</a>. This has resulted in better leveraging of engineering and IT efforts such that the resulting platforms and infrastructure are more abstract and flexible to support a wider array product service demands<a href="#_edn13">[xiii]</a>.</p>
<p>This in turn enables and moves forward various development and support strategies, such as Web Services (WS), Software as a Service (SaaS), Platform as a Service (PaaS), Infrastructure as a Service (IaaS) and Managed Services, which are sectors that have been around for sometime but have yet to reach mainstream acceptance<a href="#_edn14">[xiv]</a>. The expectation is that these will become more widely accepted and adopted successfully by more business models going forward<a href="#_edn15">[xv]</a>.</p>
<h3>The Dark Side of the Cloud</h3>
<p>As close to utopia as this would seem, cloud computing presents significant issues that must be addressed to successfully deploy applications to the cloud. The first and most significant is that cloud computing as discussed in most articles has little to do with computing and more to do with virtual hosting &#8211; getting resources into the cloud. Hosting issues that are relatively easy to address in self-hosted scenarios become an order of magnitude greater in virtual hosting environments. These include control, security, reliability, and performance monitoring<a href="#_edn16">[xvi]</a>.  Of course, new visualization technologies and standards have been developed to address these but require new IT procedures and tools that can be incompatible with standard operating procedures in a self-hosting scenario<a href="#_edn17">[xvii]</a>.  So the full implication of the change has to be considered along with business value proposition.</p>
<h2>From Virtual Hosting to Cloud Computing</h2>
<p>As important and fascinating as virtual hosting is to the evolution of the Internet, it is not by itself sufficient to ensure success or propel a Web 3.0 gestalt. Even with virtual resources with potential unlimited scalability and promised three sigma availability, ones actual results may differ. For success in realizing this potential and promise, the primary effort comes down to web architecture<a href="#_edn18">[xviii]</a>. This means actually designing the service to be elastic and highly concurrent.  We will find in this series of posts that this will require a new computing paradigm that is actually well understood and practiced by everyone except computer scientist.</p>
<p>To understand the implications of this paradigm, we will need to enlarge our scope of cloud computing beyond utility hosting to encompass how web site implementations and actual computing will change<a href="#_edn19">[xix]</a>.  This is what we will address in the next post and will begin with Google, who is the <em>defacto</em> architect of the current Cloud<a href="#_edn20">[xx]</a> and appears to be moving in different direction from conventional approaches being implemented by Amazon, Yahoo, and Microsoft<a href="#_edn21">[xxi]</a>. How can writing apps in Python be the basis of world domination? If we have learned anything from our history (as irreverent as it has been)<a href="#_edn22">[xxii]</a>, this is a clue we should probably not ignore.</p>
<h2>Works Cited</h2>
<p>Banerjee, U. (2009, 26-November). <em>Cloud Computing Service: Amazon EC2 vs  Google GAE</em>. From Cloud Computing Journal:  http://cloudcomputing.sys-con.com/node/1200706</p>
<p>Burrows, M. (2006, 7-November). <em>The Chubby Lock Service for Loosely-Coupled Distributed Systems</em>. From Google Research Publications: http://labs.google.com/papers/chubby.html</p>
<p>DataPlex. (2010, 7-January). <em>Cloud Computing Issues</em>. From Dataplex.com:  http://www.dataplex.com/blog/index.php/2010/01/07/cloud-computing-issues/</p>
<p>Dedicated Server School. (2009, 16-August). <em>Business Benefits of Cloud Computing</em>.  From Dedicated Server School:  http://www.serverschool.com/cloud-computing/business-benefits-of-cloud-computing/</p>
<p>Dugal, I. S., Alexander, S., &amp; Hellman, J. (2010, 15-July).  <em>Software as a Service</em>. From  WikiInvest: http://www.wikinvest.com/concept/Software_as_a_Service</p>
<p>Fogarty, K. (2010, 14-July). <em>Cloud Computing: Today&#8217;s Four Favorite Flavors, Explained</em>. From  ITWorld:  http://www.itworld.com/saas/113387/cloud-computing-todays-four-favorite-flavors-explained</p>
<p>Gedda, R. (2010, 24-February). <em>Open source helps Facebook achieve massive app scalability</em>. From  CIO:  http://www.cio.com.au/article/337284/open_source_helps_facebook_achieve_massive_app_scalability/</p>
<p>Grehan, R. (2008, 8-August). <em>Inside Amazon Web Services</em>. From Info World: Cloud Computing:  http://www.infoworld.com/d/cloud-computing/inside-amazon-web-services-421</p>
<p>High Scalability. (2008, 3-September). <em>Some Facebook Secrets to Better Operations</em>. From High Scalability:  http://highscalability.com/some-facebook-secrets-better-operations</p>
<p>Houghton, J. (2010, 25-July). <em>Back to the Future: Monitoring the Cloud</em>. From Cloud Computing  Journal: http://cloudcomputing.sys-con.com/node/1475427</p>
<p>Houghton, J. (2010, 16-June). <em>Cloud Computing Journal</em>. From Business Value and Cloud Computing:  http://cloudcomputing.sys-con.com/node/1433147</p>
<p>Houghton, J. (2010, 7-July). <em>Virtualization Journal</em>. From Virtualization Does Not a Cloud Make:  http://virtualization.sys-con.com/node/1455031</p>
<p>Knorr, E., &amp; Gruman, G. (2009, 5-May). <em>What cloud computing really means</em>. From  Info World, Cloud Computing:  http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031</p>
<p>Kraft, T. (2010, 02-July). <em>Clouds of Hadoop: How Map Reduce Changed the World </em>. From Mind  Before You Mine: http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/</p>
<p>Kraft, T. (2010, 25-June). <em>Send In the Clouds: Old Metaphor Gets New Life</em>. From Mind Before  You Mine: http://mindbeforeyoumine.com/2010/06/25/send-in-the-clouds/</p>
<p>Marshall, D. (2008, 02-August). <em>GoGrid wants to be the new face for cloud computing</em>. From  InfoWorld Visualization:  http://www.infoworld.com/d/virtualization/gogrid-wants-be-new-face-cloud-computing-096?r=677&amp;r=572</p>
<p>Patton, P. C., &amp; Hersh, C. I. (2003, April). <em>Asynchronous Non-Preemptive Tasking for  Parallel and Distributed Systems</em>. From A Quantitative Methods/Computer  Science White Paper: http://140.209.112.90/WhitePapers/QMCS/ANT.html</p>
<p>Ricadela, A. (2007, 16-November). <em>Computing Heads for the Clouds</em>. From Bloomberg Businessweek:  http://www.businessweek.com/technology/content/nov2007/tc20071116_379585.htm</p>
<p>Rothschild, J. (2009, 13-October). <em>Why are Facebook, Digg, and Twitter so hard to scale?</em> From High  Scalability:  http://highscalability.com/blog/2009/10/13/why-are-facebook-digg-and-twitter-so-hard-to-scale.html</p>
<p>Schonfeld, E. (2010, 7-September). <em>As Digg Struggles, VP Of Engineering Is Shown The Door</em>. From  TechCrunch:  http://techcrunch.com/2010/09/07/digg-struggles-vp-engineering-door/</p>
<p>Staten, J. (2010, 02-August). <em>To Get Cloud Economics Right, Think Small, Very, Very Small</em>. From  James Staten&#8217;s Blog: http://blogs.forrester.com/james_staten/10-08-02-get_cloud_economics_right_think_small_very_very_small</p>
<div>
<hr size="1" />
<div>
<p><a href="#_ednref">[i]</a> <a href="http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031">(Knorr &amp; Gruman, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[ii]</a> <a href="http://cloudcomputing.sys-con.com/node/1433147">(Houghton, Cloud Computing Journal, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[iii]</a> <a href="http://www.serverschool.com/cloud-computing/business-benefits-of-cloud-computing/">(Dedicated Server School, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[iv]</a> <a href="http://www.infoworld.com/d/cloud-computing/inside-amazon-web-services-421">(Grehan, 2008)</a></p>
</div>
<div>
<p><a href="#_ednref">[v]</a> <a href="http://www.infoworld.com/d/virtualization/gogrid-wants-be-new-face-cloud-computing-096?r=677&amp;r=572">(Marshall, 2008)</a></p>
</div>
<div>
<p><a href="#_ednref">[vi]</a> <a href="http://blogs.forrester.com/james_staten/10-08-02-get_cloud_economics_right_think_small_very_very_small">(Staten, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[vii]</a> Virtualization is the process of visualizing a diverse array of physical resources as one virtual entity with respect to monitoring and management control or conversely sharing the same set of resources among multiple virtual entities with respect to operating system and server environment. <a href="http://virtualization.sys-con.com/node/1455031">(Houghton, Virtualization Journal, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[viii]</a> <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a> see section: Brief Parable on Scale</p>
</div>
<div>
<p><a href="#_ednref">[ix]</a> The most prevalent strategy that is the basis of all Internet protocols uses Praxos algorithms. See <a href="http://en.wikipedia.org/wiki/Paxos_algorithm">(Wikipedia)</a> also for its application in cloud computing <a href="http://labs.google.com/papers/chubby.html">(Burrows, 2006)</a></p>
</div>
<div>
<p><a href="#_ednref">[x]</a> Sullivan’s Law comes from a private conversation with the author that summarized the gestalt between scalability and efficiency summarized in the Brief Parable on Scale <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a> but a summary paper on Dr. Sullivan’s contributions can be found here, <a href="http://140.209.112.90/WhitePapers/QMCS/ANT.html">(Patton &amp; Hersh, 2003)</a></p>
</div>
<div>
<p><a href="#_ednref">[xi]</a> Just remember, Scalability and Availability form the parallel plane that is perpendicular to Efficiency. If the last sentence makes sense then you maybe on your way to thinking concurrently.  If not don’t worry, come back after reading this series and see if makes more sense.</p>
</div>
<div>
<p><a href="#_ednref">[xii]</a> Scalability is not guaranteed. For example, Diggs is currently struggling leading the VP of Engineering recently being shown the door <a href="http://techcrunch.com/2010/09/07/digg-struggles-vp-engineering-door/">(Schonfeld, 2010)</a>.  A description of the difficulty can be found here <a href="http://highscalability.com/blog/2009/10/13/why-are-facebook-digg-and-twitter-so-hard-to-scale.html">(Rothschild, 2009)</a>. A list of how Facebook addresses scalability <a href="http://highscalability.com/some-facebook-secrets-better-operations">(High Scalability, 2008)</a> as well as how it uses Open Source <a href="http://www.cio.com.au/article/337284/open_source_helps_facebook_achieve_massive_app_scalability/">(Gedda, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xiii]</a> <a href="http://www.itworld.com/saas/113387/cloud-computing-todays-four-favorite-flavors-explained">(Fogarty, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xiv]</a> <a href="http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031">(Knorr &amp; Gruman, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[xv]</a> <a href="http://www.wikinvest.com/concept/Software_as_a_Service">(Dugal, Alexander, &amp; Hellman, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xvi]</a> <a href="http://www.dataplex.com/blog/index.php/2010/01/07/cloud-computing-issues/">(DataPlex, 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xvii]</a> <a href="http://cloudcomputing.sys-con.com/node/1475427">(Houghton, Back to the Future: Monitoring the Cloud,  2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xviii]</a> This was the general theme of the review of current cloud offerings by <a href="http://www.infoworld.com/d/cloud-computing/what-cloud-computing-really-means-031">(Knorr &amp; Gruman, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[xix]</a> <a href="http://www.businessweek.com/technology/content/nov2007/tc20071116_379585.htm">(Ricadela, 2007)</a></p>
</div>
<div>
<p><a href="#_ednref">[xx]</a> <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a></p>
</div>
<div>
<p><a href="#_ednref">[xxi]</a> <a href="http://cloudcomputing.sys-con.com/node/1200706">(Banerjee, 2009)</a></p>
</div>
<div>
<p><a href="#_ednref">[xxii]</a> Irreverent History of Cloud 1.0 <a href="http://mindbeforeyoumine.com/2010/06/25/send-in-the-clouds/">(Kraft T. , 2010) </a>and Cloud 2.0 <a href="http://mindbeforeyoumine.com/2010/07/02/clouds-of-hadoop/">(Kraft T. , 2010)</a></p>
</div>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/574/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/574/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/574/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=574&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/07/19/cloud-computing-from-both-sides-now/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" />
		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0219.jpg?w=112" medium="image">
			<media:title type="html">HAL Cloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/img_0320.png" medium="image">
			<media:title type="html">Grid and Slice Cloud</media:title>
		</media:content>
	</item>
		<item>
		<title>McLuhan&#8217;s Tetrad Applied to Internet</title>
		<link>http://mindbeforeyoumine.com/2010/07/12/mcluhans-tetrad-applied-to-internet/</link>
		<comments>http://mindbeforeyoumine.com/2010/07/12/mcluhans-tetrad-applied-to-internet/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 19:28:58 +0000</pubDate>
		<dc:creator>Timothy Kraft</dc:creator>
				<category><![CDATA[Asides]]></category>
		<category><![CDATA[Blogosphere]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Marshall McLuhan]]></category>
		<category><![CDATA[measurement]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[online / offline behavior]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[VoC]]></category>

		<guid isPermaLink="false">https://kraftt.wordpress.com/?p=563</guid>
		<description><![CDATA[An obvious and much anticipated characteristic of the internet is that it "makes the world much smaller". One might expect in the free exchange of ideas and knowledge that the world as whole would become more unified. But as we look at society, it is anything but unified. Can the Internet as a medium be responsible for this? <a href="http://mindbeforeyoumine.com/2010/07/12/mcluhans-tetrad-applied-to-internet/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=563&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I have been looking at how to apply Marshall McLuhan&#8217;s <a href="http://www.horton.ednet.ns.ca/staff/scottbennett/media/">Laws of Media</a> to the Web for my next post on cloud computing. For a &#8220;hard&#8221; scientist, McLuhan is as close as we will get to Newton in Sociology for understanding how technology effects society. His Laws of Media are as rigorous as Newton&#8217;s and his equation &#8211; &#8220;<a href="http://individual.utoronto.ca/markfederman/article_mediumisthemessage.htm">the medium is the message</a>&#8221; &#8211; as famous as Einstein&#8217;s &#8220;E equals m c squared&#8221; if not as equally obtuse.</p>
<p>One can be easily lulled into believing it is <a href="http://www.onthemedia.org/2004/apr/16/mcluhan-on-mcluhan/">simple and straight forward</a>. Though McLuhan&#8217;s presentation <a href="http://www.marshallmcluhan.com/poster.html">may seem flippant</a>, the sociology in the background is serious and complex. Indeed it is the background or context of technology that at first seems unmoved and unchanging that does eventually change in ways that might be missed because the changes appear unrelated. But it is in understanding both the obvious implications of a technology and the not so obvious effects on culture and society that transforms technology into media in McLuhan&#8217;s view.</p>
<p>For example, an obvious and much anticipated characteristic of the internet is that it <a href="http://wiki.answers.com/Q/Why_is_it_technology_makes_the_world_smaller">&#8220;makes the world much smaller&#8221;</a>. One might expect in the free exchange of ideas and knowledge that the world as whole would become more unified. But as we look at society, it is anything but unified. Can the Internet as a medium be responsible for this?<br />
<span id="more-563"></span></p>
<p><a href="http://kraftt.files.wordpress.com/2010/07/p_834_734_cb6eca55-3052-406c-b560-865c46ff8c1d.jpeg"><img class="alignright size-medium wp-image-558" title="p_834_734_CB6ECA55-3052-406C-B560-865C46FF8C1D.jpeg" src="http://kraftt.files.wordpress.com/2010/07/p_834_734_cb6eca55-3052-406c-b560-865c46ff8c1d.jpeg?w=264&#038;h=300" alt="" width="264" height="300" /></a>As I look at the problem from the 4 Laws of Media, the results are startling and not what I initially expected. To review briefly: the four laws represent four aspects of media that must always be present. To apply, one must not only consider what is manifest and technical but also latent and behavioral in the background.</p>
<p>The result is a tetrad consisting of four co-equal aspects of the mediums effect on human behavior:</p>
<table>
<tbody>
<tr>
<td>Enhance</td>
<td>The technology must enhance some capability of the person. The medium is an extension of the person.</td>
</tr>
<tr>
<td>Retrieve</td>
<td>The result is a retrieval of some earlier service or capability that was lost and is now brought back into play in a new form.</td>
</tr>
<tr>
<td>Obsolesced</td>
<td>What is pushed aside and made obsolescent.</td>
</tr>
<tr>
<td>Reverse</td>
<td>If the new media is taken to extreme what will result that reverses the original characteristics of the media when it was first introduced.</td>
</tr>
</tbody>
</table>
<p>So now apply these rules to a medium such as the Internet (before the Web). The Internet (as invented by DARPA and Al Gore) enhances thought both in retrieving information that nurtures thought and expressing thoughts to help others. It obsolesced books or publications that inform thought but at a much slower rate and at higher cost preventing just anyone off the street from participating (except as a reader). The Internet retrieves the Forum in Athens where individuals came to share thoughts and dialogue.</p>
<p>Normally one would think of the Internet as a medium of communication with dialog as a form of communication. Cell telephones are a form of communication medium that enhances the voice, the Internet goes beyond voice and records &amp; recalls what has been &#8220;voiced&#8221;. It is therefore an extension of the minds creative abilities to recall, extend and amplify thought.</p>
<p>This is important when considering the original characteristics and intentions and how the medium when taken to extremes reverses these characteristics. The expectation was that the Internet would bring democracy or equality to thought allowing persons anywhere in the world to participate in the public forum. The hope was for unity of thought that transcended specific cultures or traditions &#8211; world-wide <a href="http://www.culturalization.com/">culturalization</a>. A scientist in Botswana would be on equal footing to one at MIT or Oxford.</p>
<p><a href="http://kraftt.files.wordpress.com/2010/07/p_834_734_1d840083-0a32-476c-99a9-be0bb3c244e9.jpeg"><img class="alignleft size-medium wp-image-559" title="p_834_734_1D840083-0A32-476C-99A9-BE0BB3C244E9.jpeg" src="http://kraftt.files.wordpress.com/2010/07/p_834_734_1d840083-0a32-476c-99a9-be0bb3c244e9.jpeg?w=264&#038;h=300" alt="" width="264" height="300" /></a>The expected unity has not materialized. That is because the reverse of thought is thoughtlessness or those things that imprison the mind. Instead of amplifying just good and reasonable thoughts, the Internet allows marginal thoughts that would be normally isolated and pushed to the &#8220;fringe&#8221; of society to likewise find and bring together the few supporters to amplify dogma, bigotry, lies, superstition, and racism just as quickly. In a world of false equivalency, it seems only fair that every thought whether based on truth or lies to be treated equally.</p>
<p>Instead of unity, we have a fragmented American as well as World culture where individuals can believe and operate in their own &#8220;realities&#8221; with their own &#8220;truths&#8221; and find others that believe the same. So the fact that in this country no &#8220;reality&#8221; can get more than 20% to 30% confirmation in polling, is this because of the Internet? Your thoughts please.</p>
<p>To check this further, I have applied the Tetrads to Facebook, Twitter, and the Blogosphere. Amazingly they are very different media though they seem to be doing similar things communication wise. What are your views?</p>
<table>
<tbody>
<tr>
<td><a href="http://kraftt.files.wordpress.com/2010/07/p_834_734_8c1f012a-9b58-4357-a4f7-f7ea02c058fc.jpeg"><img class="alignleft size-thumbnail wp-image-560" title="p_834_734_8C1F012A-9B58-4357-A4F7-F7EA02C058FC.jpeg" src="http://kraftt.files.wordpress.com/2010/07/p_834_734_8c1f012a-9b58-4357-a4f7-f7ea02c058fc.jpeg?w=132&#038;h=150" alt="" width="132" height="150" /></a></td>
<td><a href="http://kraftt.files.wordpress.com/2010/07/p_834_734_b4b51334-24bb-4c66-af1a-6a82c1133cd3.jpeg"><br />
<img class="alignnone size-thumbnail wp-image-561" title="p_834_734_B4B51334-24BB-4C66-AF1A-6A82C1133CD3.jpeg" src="http://kraftt.files.wordpress.com/2010/07/p_834_734_b4b51334-24bb-4c66-af1a-6a82c1133cd3.jpeg?w=132&#038;h=150" alt="" width="132" height="150" /></a></td>
<td><a href="http://kraftt.files.wordpress.com/2010/07/p_834_734_3fcd3e5b-8366-42b5-ada8-a4979db23240.jpeg"><img class="alignnone size-thumbnail wp-image-562" title="p_834_734_3FCD3E5B-8366-42B5-ADA8-A4979DB23240.jpeg" src="http://kraftt.files.wordpress.com/2010/07/p_834_734_3fcd3e5b-8366-42b5-ada8-a4979db23240.jpeg?w=132&#038;h=150" alt="" width="132" height="150" /></a></td>
</tr>
</tbody>
</table>
<p>Not to belabor the obvious, yes this is important for anyone attempting to measure &#8220;behavior&#8221; on the Internet because we are attempting grasp the effect (sociology) of technology on behavior. So get out your crayons &amp; paper and start drawn up some Tetrads!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/kraftt.wordpress.com/563/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/kraftt.wordpress.com/563/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/kraftt.wordpress.com/563/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mindbeforeyoumine.com&amp;blog=6433374&amp;post=563&amp;subd=kraftt&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mindbeforeyoumine.com/2010/07/12/mcluhans-tetrad-applied-to-internet/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7838f6345b0755d272a38ccb666d58ce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kraftt</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/p_834_734_cb6eca55-3052-406c-b560-865c46ff8c1d.jpeg?w=264" medium="image">
			<media:title type="html">p_834_734_CB6ECA55-3052-406C-B560-865C46FF8C1D.jpeg</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/p_834_734_1d840083-0a32-476c-99a9-be0bb3c244e9.jpeg?w=264" medium="image">
			<media:title type="html">p_834_734_1D840083-0A32-476C-99A9-BE0BB3C244E9.jpeg</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/p_834_734_8c1f012a-9b58-4357-a4f7-f7ea02c058fc.jpeg?w=132" medium="image">
			<media:title type="html">p_834_734_8C1F012A-9B58-4357-A4F7-F7EA02C058FC.jpeg</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/p_834_734_b4b51334-24bb-4c66-af1a-6a82c1133cd3.jpeg?w=132" medium="image">
			<media:title type="html">p_834_734_B4B51334-24BB-4C66-AF1A-6A82C1133CD3.jpeg</media:title>
		</media:content>

		<media:content url="http://kraftt.files.wordpress.com/2010/07/p_834_734_3fcd3e5b-8366-42b5-ada8-a4979db23240.jpeg?w=132" medium="image">
			<media:title type="html">p_834_734_3FCD3E5B-8366-42B5-ADA8-A4979DB23240.jpeg</media:title>
		</media:content>
	</item>
	</channel>
</rss>
