<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.0.3" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Applied Insights</title>
	<link>http://www.applied-insights.co.uk</link>
	<description>Creating customer insight through data</description>
	<pubDate>Fri, 20 Jun 2008 15:03:16 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.3</generator>
	<language>en</language>
			<item>
		<title>Announcing the launch of our YouTube channel</title>
		<link>http://www.applied-insights.co.uk/news/2008/06/17/announcing-the-launch-of-our-youtube-channel/</link>
		<comments>http://www.applied-insights.co.uk/news/2008/06/17/announcing-the-launch-of-our-youtube-channel/#comments</comments>
		<pubDate>Tue, 17 Jun 2008 21:19:09 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2008/06/17/announcing-the-launch-of-our-youtube-channel/</guid>
		<description><![CDATA[Neil and John have launched themselves into the 21st century and have set up a YouTube channel. You can find it hereâ€¦

www.youtube.com/appliedinsights
	
So far we have some video blogs up there from the recent Emetrics conferences in San Francisco and London. Over time we will be adding to that with discussions and video seminars on various [...]]]></description>
			<content:encoded><![CDATA[<p>Neil and John have launched themselves into the 21<sup>st</sup> century and have set up a YouTube channel. You can find it hereâ€¦
</p>
<p><a href="http://www.youtube.com/appliedinsights">www.youtube.com/appliedinsights</a>
	</p>
<p>So far we have some video blogs up there from the recent Emetrics conferences in San Francisco and London. Over time we will be adding to that with discussions and video seminars on various topics to do with digital marketing and predictive analytics, or even digital marketing predictive analytics!
</p>
<p>Here&#8217;s a sample which is Neil catching up with Avinash Kaushik towards the end of Emetrics San Francisco 2008.
</p>
<p><object type="application/x-shockwave-flash" style="width:425px; height:355px;" data="http://www.youtube.com/v/KMHsz3alHjs&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1"><param name="movie" value="http://www.youtube.com/v/KMHsz3alHjs&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1" /></object>
</p>
<p>
Â </p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2008/06/17/announcing-the-launch-of-our-youtube-channel/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Report from the Frontline: Emetrics San Francisco</title>
		<link>http://www.applied-insights.co.uk/news/2008/05/14/report-from-the-frontline-emetrics-san-francisco/</link>
		<comments>http://www.applied-insights.co.uk/news/2008/05/14/report-from-the-frontline-emetrics-san-francisco/#comments</comments>
		<pubDate>Wed, 14 May 2008 20:04:38 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2008/05/14/report-from-the-frontline-emetrics-san-francisco/</guid>
		<description><![CDATA[Last week I went to the Emetrics Marketing Optimisation Summit in San Francisco. You can read about my impressions of the conference over at my column at ClickZ and also watch a series of video blogs over at the Applied Insights channel on YouTube.
]]></description>
			<content:encoded><![CDATA[<p>Last week I went to the Emetrics Marketing Optimisation Summit in San Francisco. You can read about my impressions of the conference over at my column at <a href="http://www.clickz.com/showPage.html?page=3629459">ClickZ</a> and also watch a series of video blogs over at the <a href="http://www.youtube.com/user/AppliedInsights">Applied Insights channel on YouTube</a>.</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2008/05/14/report-from-the-frontline-emetrics-san-francisco/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Does Europe need its own Web Analytics Association?</title>
		<link>http://www.applied-insights.co.uk/news/2008/05/02/does-europe-need-its-own-web-analytics-association/</link>
		<comments>http://www.applied-insights.co.uk/news/2008/05/02/does-europe-need-its-own-web-analytics-association/#comments</comments>
		<pubDate>Fri, 02 May 2008 11:01:38 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Europe</category>
	<category>WAA</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2008/05/02/does-europe-need-its-own-web-analytics-association/</guid>
		<description><![CDATA[The answer is probably yes. There â€“ that was easy enough.
But then it gets a bit trickier, the questions start piling up. Like:

What would a European WAA look like?
How would it be organised?
How would it work (or not) along side the existing WAA?
What kind of legal status would it take?
How would it be funded?

And probably [...]]]></description>
			<content:encoded><![CDATA[<p>The answer is probably yes. There â€“ that was easy enough.</p>
<p>But then it gets a bit trickier, the questions start piling up. Like:</p>
<ul>
<li>What would a European WAA look like?</li>
<li>How would it be organised?</li>
<li>How would it work (or not) along side the existing WAA?</li>
<li>What kind of legal status would it take?</li>
<li>How would it be funded?</li>
</ul>
<p>And probably a whole lot more that I haven&#8217;t thought of yet.</p>
<p>I have to declare an interest here. As a Board member on the Web Analytics Association, my responsibility is for &#8220;International&#8221;. I think last year was the first year that a Director on the Board had responsibility for &#8220;International&#8221;, though there has always been an International Committee ably co-chaired by Vicky Brock and Steve Jackson. Vicky, Steve and others have done a great job over the years helping to get activity happening at the local level in markets around the world. When I came onto the Board a year ago I agreed with Steve and Vicky that our priorities should be to continue to expand our international reach and also to look for ways to deliver more value to our international members.</p>
<p>A year on, have we done as much as we would have liked? Probably not. We are all volunteers, doing this is our own time. Most of us run our own businesses but I&#8217;m amazed at the amount of time that people do put in around the world on a volunteer basis.</p>
<p>Have we made any progress? Yes, we have. We have a new structure in place on the International Committee that should allow us to expand without losing focus and coordination. We have expanded into new markets by appointing country managers in places such as France, Spain, Russia, Argentina and Brazil. We are looking at how we can expand our activities into Asia. We are working on the <em>structures</em> and <em>processes</em> which will enable us to better help volunteer activity on the ground.</p>
<p>Could we be doing more? Absolutely. And it is a real case of &#8220;many hands making light work&#8221;. We need people to step up to the plate and get involved. I know it can be frustrating that sometimes we don&#8217;t seem to react in real time but as I said before that as a volunteer organisation it can take time to have the meetings, make the calls, to come to the decisions.</p>
<p>So back to the questionâ€¦ Does Europe need its own WAA? The answer is still &#8220;probably&#8221; but, the reality is that at this moment in time I don&#8217;t know. There is no doubt that International representation is getting stronger within the WAA. In 2006 there were no European Directors on the Board, in 2007 there were two. In 2008 hopefully more! As someone who spent many years working in the European divisions of US companies, I am well aware of the frustrations that can cause! I do think though that the WAA is becoming more internationally orientated and this debate about a European WAA is a great one to have. For me the next step is to work out how we get to the point of decision. There&#8217;s a lot of work to be done finding out what&#8217;s the best thing to do and how best to do it. We&#8217;ll be kicking that process off in San Francisco next week. After that I am sure we will be looking for all the help we can get! If you&#8217;re interested in helping out with the International activities and development of the WAA, let me know.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2008/05/02/does-europe-need-its-own-web-analytics-association/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Back to blogging?</title>
		<link>http://www.applied-insights.co.uk/news/2008/04/30/back-to-blogging/</link>
		<comments>http://www.applied-insights.co.uk/news/2008/04/30/back-to-blogging/#comments</comments>
		<pubDate>Wed, 30 Apr 2008 17:53:38 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2008/04/30/back-to-blogging/</guid>
		<description><![CDATA[It&#8217;s been a year since we last blogged on the site and we have had the odd comment about it. At the back end of last year Avinash mentioned in his blog review of my presentation at Emetrics in Washington DC that we should have more content in our blog. As I explained to Avinash [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s been a year since we last blogged on the site and we have had the odd comment about it. At the back end of last year Avinash mentioned in his <a href="http://www.kaushik.net/avinash/2007/12/emetrics-dc-07-reflections-accuracy-precision-predictive-analytics.html">blog review</a> of my presentation at <a href="http://www.applied-insights.co.uk/news/2007/10/23/emetrics-marketing-optimization-summit-washington-dc-october-2007/">Emetrics in Washington DC</a> that we should have more content in our blog. As I explained to Avinash one of the challenges with a <a href="http://www.clickz.com/showPage.html?page=3622884">regular column at ClickZ</a> that it&#8217;s not always easy to come up with sufficiently new and interesting stuff to talk about. And in any case there&#8217;s no way to compete with Avinash&#8217;s prolific output!
</p>
<p>The web analytics space has many great bloggers, covering many different areas. Some are great on the technical side of explaining how to &#8220;do&#8221; web analytics, some cover industry developments and so on. I think it&#8217;s important to find some way to contribute to the debates without just adding to the noise. So, we&#8217;re going to give it another shot. John&#8217;s going to look at developments in the world of predictive analytics and data mining and I&#8217;m going to muse about digital marketing analytics and what&#8217;s happening here in Europe. We&#8217;ll be trying a few things out to see if they work, some will, some won&#8217;t, but that&#8217;s the name of the game. Stay tuned.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2008/04/30/back-to-blogging/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Want to find out more about Predictive Analytics?</title>
		<link>http://www.applied-insights.co.uk/news/2008/04/30/want-to-find-out-more-about-predictive-analytics/</link>
		<comments>http://www.applied-insights.co.uk/news/2008/04/30/want-to-find-out-more-about-predictive-analytics/#comments</comments>
		<pubDate>Wed, 30 Apr 2008 11:00:13 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2008/04/30/want-to-find-out-more-about-predictive-analytics/</guid>
		<description><![CDATA[If so, then Neil and John are running a one day workshop in Predictive Analytics in association with the Emetrics Marketing Optimisation summit on 22nd May at the Hotel Russell in London.
For more details and to register, click here.

]]></description>
			<content:encoded><![CDATA[<p>If so, then Neil and John are running a one day workshop in Predictive Analytics in association with the Emetrics Marketing Optimisation summit on 22<sup>nd</sup> May at the Hotel Russell in London.</p>
<p>For more details and to register, <a href="http://www.emetrics.org/2008/london/predictiveanalytics.php">click here</a>.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2008/04/30/want-to-find-out-more-about-predictive-analytics/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>I&#8217;ve always wanted to say this..&#8221;Vote for me!&#8221;</title>
		<link>http://www.applied-insights.co.uk/news/2007/04/04/ive-always-wanted-to-say-thisvote-for-me/</link>
		<comments>http://www.applied-insights.co.uk/news/2007/04/04/ive-always-wanted-to-say-thisvote-for-me/#comments</comments>
		<pubDate>Wed, 04 Apr 2007 16:53:54 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Analytics strategy</category>
	<category>Web analytics</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2007/04/04/ive-always-wanted-to-say-thisvote-for-me/</guid>
		<description><![CDATA[Those of you that are members of the Web Analytics Association (WAA) will have had an email recently about the Election Ballot for the WAA Board of Directors. This year I have put forward my nomination to be on the Board of Directors and it would be great if you felt able to support me.
During [...]]]></description>
			<content:encoded><![CDATA[<p>Those of you that are members of the Web Analytics Association (<a title="WAA website" target="_blank" href="http://www.webanalyticsassociation.org/">WAA</a>) will have had an email recently about the Election Ballot for the WAA Board of Directors. This year I have put forward my nomination to be on the Board of Directors and it would be great if you felt able to support me.</p>
<p>During the nomination process we were asked to respond to some questions which might help people understand why we wanted to be on the Board. However, I can&#8217;t find this copy on the WAA  website anywhere (and I have looked), only <a title="WAA Nominations" target="_blank" href="http://www.webanalyticsassociation.org/en/cms/?757">the profile</a> that we were also asked to complete.</p>
<p>So I thought I would publish my responses to the questions here in case you wanted to understand my perspective on industry matters and motivation to be a part of the WAA team.</p>
<p>There are also many great people who are seeking election, some of whom I know and some of whom I don&#8217;t. Since you can vote for as many people as you want, I don&#8217;t feel too bad about this shameless canvassing! The greater the depth and breadth of experience and expertise the WAA has at its disposal the better I feel.</p>
<p>Anyway - here&#8217;s my &#8220;manifesto&#8221;.</p>
<p class="MsoNormal"><span lang="EN-GB"><strong>Which current committee do you have a passion about? Or is there a committee you would like to see us undertake?</strong></span></p>
<p class="MsoNormal"><span lang="EN-GB">Coming from the UK Iâ€™m naturally interested in the International aspects of the organisationâ€™s work. Iâ€™m also interested in raising the profile of the industry, so I guess that means Marketing!</span></p>
<p class="MsoNormal"><span lang="EN-GB"><strong>What is the greatest challenge you see the Web Analytics industry facing?</strong></span></p>
<p class="MsoNormal"><span lang="EN-GB">I sometimes think that the industry defines itself in too narrow a position. Iâ€™m a great believer that you need to have a holistic approach to thinking about the effectiveness of their investments in the online channel. The industry needs to continue to broaden and integrate with other online marketing technologies and data providers if it is not to become a commodity data provide to the online marketing industry.</span></p>
<p class="MsoNormal"><span lang="EN-GB"><strong>What major contribution will you bring to WAA and its membership?</strong></span></p>
<p class="MsoNormal"><span lang="EN-GB">I think I will bring a broader perspective to the WAA. I have worked in marketing analytics all of my career but in a number of different functions and environments. As a result I think I have a rounded perspective and approach to web analytics and what it means for organizations. I think this perspective will help the WAA to continue to move forward.</span></p>
<p class="MsoNormal"><span lang="EN-GB">I also think that I can help support and develop the WAA over here in Europe.</span></p>
<p class="MsoNormal"><strong><span lang="EN-GB">Why should people vote for you?</span></strong></p>
<p class="MsoNormal"><span lang="EN-GB">Because they want to?</span></p>
<p class="MsoNormal"><strong><span lang="EN-GB">Where do you think the organization should be in the next year?</span></strong></p>
<p class="MsoNormal"><span lang="EN-GB">As I mentioned earlier I think that the organization needs to recognise and embrace the wider perspective when it comes to measuring and optimizing the online channel. It should look to forge partnerships with other associated bodies and organizations in areas such as usability, marketing research and surveys, advertising research and so on.</span></p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2007/04/04/ive-always-wanted-to-say-thisvote-for-me/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>How to do Predictive Analytics - Part 5</title>
		<link>http://www.applied-insights.co.uk/news/2007/03/15/how-to-do-predictive-analytics-part-5/</link>
		<comments>http://www.applied-insights.co.uk/news/2007/03/15/how-to-do-predictive-analytics-part-5/#comments</comments>
		<pubDate>Thu, 15 Mar 2007 15:29:49 +0000</pubDate>
		<dc:creator>John McConnell</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Analytics strategy</category>
	<category>Predictive analytics</category>
	<category>Data mining</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2007/03/15/how-to-do-predictive-analytics-part-5/</guid>
		<description><![CDATA[Steps 4 and 5 â€“ Modelling and Evaluation, The Theory
Now for the serious stuff (or the fun stuff depending on your inclination!). Of course the modelling phase is at the core of a predictive analytic effort. CRISP rightly separates modelling and evaluation into separate steps which emphasises the importance of the latter. However they are [...]]]></description>
			<content:encoded><![CDATA[<p><span lang="EN-GB"><strong><span lang="EN-GB">Steps 4 and 5 â€“ Modelling and Evaluation, The Theory<br />
</span></strong></span><span lang="EN-GB">Now for the serious stuff (or the fun stuff depending on your inclination!). Of course the modelling phase is at the core of a predictive analytic effort. <a title="CRISP Home" target="_blank" href="http://www.crisp-dm.org/">CRISP</a> rightly separates modelling and evaluation into separate steps which emphasises the importance of the latter. However they are intrinsically linked and we will consider them both together here.<br />
</span><span lang="EN-GB" /><span lang="EN-GB">As this is really the central issue Iâ€™ll break into 2 parts. Letâ€™s talk about the theory of how we go about it and the in the next blog entry Iâ€™ll try and â€œmake it realâ€? with a practical example.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB">Just to recap on how we got here. Starting with a research or business objective weâ€™ve garnered enough understanding to embark on a predictive exercise. Furthermore weâ€™ve explored the data and found predictive potential. More than likely weâ€™ve uncovered enough relationships in the data as we explored it to indicate that patterns exist which will allow us to predict the outcome(s) of interest.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB"><strong>So who can do this?<br />
</strong></span><span lang="EN-GB">Traditionally predictive modelling has been the domain of the expert. The statistician, mathematician, econometrician, the numerate researcher or the more expert â€œanalystâ€?, etc..  This is still largely the case today but we are seeing increasing signs of analytical democratisation.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB">Some of the contemporary tools discussed below do require less expertise to develop models today because of smarter user interfaces. There has been a  move to more automated algorithms like decision trees where the analyst does not need to know as much about the data structures or the requirements/assumptions of the algorithm to specify the <em>correct</em> analysis. More traditional statistical methods, like Regressions for example, do require the analyst to understand the technique well enough to specify the right settings/options and to follow certain rules about the data; e.g. that the input variables are not too highly correlated (i.e. â€œmulti-colinearâ€?). Methods and algorithms from the world of Artificial Intelligence e.g. Neural nets, and the trees are generally more tolerant of different data patterns and have fewer options for the analyst to worry about.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB">Nevertheless it is rarely the case that we can just â€œpress the buttonâ€? without a certain level of expertise in the analytical tool and/or the handling of data. But with a few days training most business and research users should be able to run models even in the most advanced tools. </span><span lang="EN-GB">More specifically developed Analytical Applications can often provide a higher level of accessibility to deeper analytical methods for broader, less expert, audience.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB"><strong>And how?&#8230;<br />
</strong></span><span lang="EN-GB">For heavy duty predictive modelling the analyst will typically have an arsenal of predictive tools and algorithms at his/her disposal. Weâ€™ll revisit the various tools/platforms later but the vendors who probably offer the most are <a title="SAS Home" target="_blank" href="http://www.sas.com/">SAS</a> and <a title="SPSS Home" target="_blank" href="http://www.spss.com/">SPSS</a>. Though there are some, relatively, new entrants making headway such as <a title="KXEN home" target="_blank" href="http://www.kxen.com/">KXEN</a>, <a title="Salford Systems Home" target="_blank" href="http://www.salford-systems.com/">Salford Systems</a> and <a title="Think Analytics home" target="_blank" href="http://www.thinkanalytics.com/">Think Analytics</a>. See the Gartner <a title="Gartner Quadrant" target="_blank" href="http://www.gartner.com/DisplayDocument?id=488171">Magic Quadrant for Customer Data Mining</a> for one view of the landscape of predictive software tools.<br />
</span><span lang="EN-GB"><br />
</span><span lang="EN-GB" /><span lang="EN-GB">In the last step we spent some time ensuring that the data was in the right shape for this step. Hence, in the simplest sense the modelling process itself is just about defining the input and output variable(s) of interest and building and evaluating multiple models.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB"><strong>Which method to choose?<br />
</strong></span><span lang="EN-GB">In part of course this will depend on what you have available. If you only have Excel then, without purchasing an add-on like <a title="XLMiner home" target="_blank" href="http://www.resample.com/xlminer/">XLMiner</a>, you have access to the models available in the Excel statistical pack. As I mentioned in earlier blogs if you are entering the predictive arena for the first time you may want to consider some of the freely available software, particularly <a title="R Project Home" target="_blank" href="http://www.r-project.org/">R</a>. The caveat to this is that, as I write, you need to be able to learn the R language to drive the models. I am not currently aware of any particular user interfaces that help accelerate the usage. Despite that initial technical hurdle R does offer a very impressive range of modelling algorithms. </span><span lang="EN-GB">Alternatively you may have one, or more, of the toolsets from the Gartner Quadrant mentioned earlier.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB">We should probably try as many of the appropriate candidate models as time allows. Some â€“ particularly those that come from classical statistics (see the earlier point) â€“ may not be appropriate because of the shape of the data so may be rule out. Going in, especially with new data, it is usually difficult to know which type of model will give us the best predictions  . From experience analysts may like to start with methods they know have produced the best models with what feels like similar data.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB"><strong>So what is a model?<br />
</strong></span><span lang="EN-GB">The different types of algorithms construct models in different styles but at the most abstract level a model defines a pattern, or relationship, between the input variables and the output (outcome) variables. A [<strong>S</strong></span><span lang="EN-GB"><strong>tatistical</strong>] regression model, for example, will use a mathematical formula to achieve this. A <strong>Decision Tree/Rule induction</strong> model will produce a tree or a set of rules to characterise the relationship. Whereas a <strong>Neural Network</strong> model will typically build a more opaque view of the relationships by connecting an abstract network of nodes, links and weights to encapsulate the underlying pattern.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB"><strong>The core train/test process<br />
</strong></span><span lang="EN-GB">One of the beauties of predictive analytics is the way in which we construct a simple experimental structure which allows us to test (validate) models on unseen data. The empirical approach, if it is done properly, gives us a pretty good approximation to how the models will perform when deployed in a live setting on new data. For example, letâ€™s say we have a data set from a period in time when we know which customers churned or stayed. We would typically model a customerâ€™s likelihood to churn on a subset (60% say) of that data  and then test it on the other 40% to see how well the model predicts churn. If the accuracy is good enough (and that depends on the success criteria that we defined) then â€¦ if all other things are equal and we had constructed a representative enough data mining table â€¦ then we would expect similar results if we use the model going forward in a live setting. </span><span lang="EN-GB" /><span lang="EN-GB">Usually this means that we randomly split the data into two subsets</span><span lang="EN-GB" /></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB" /></p>
<ul>
<li>
<div><span lang="EN-GB">The training subset is the one use to build the model (the 60% in the churn modelling scenario described above).<br />
</span></div>
</li>
<li>
<div><span lang="EN-GB">The testing subset is the one we use to evaluate the model (the 40% for the above scenario). This second set is used to effectively simulate what we want to do in practice (when we deploy); that is to use our model to accurately predict the outcome(s) of interest.</span></div>
</li>
</ul>
<p><span lang="EN-GB">We do this because the true test of a model is not how well it can predict the outcome when it knows it (which is what it does with the training subset). Rather how well can it predict the outcome when it doesnâ€™t know what the outcome is.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB"><strong>So how good is my model (really)?<br />
</strong></span><span lang="EN-GB">Until now we have only considered how accurate the model is by considering what percentage of the time it gets the prediction right e.g. predict churners. In practice of course this is only part of the evaluation process. We may find, for example, that our model is good at finding low value fraud (of which there is likely to be more and hence our overall percentage prediction) is higher â€¦ but that the more valuable transactions which hurt us more are missed. One way to address this could be to focus on (e.g. create a subset which focuses on the valuable minority while still being sufficiently representative to be deployable). Either way our evaluation of candidate models, and hence the models we might continue to develop and refine, should be led by model evaluations which include all the factors that we really care about. These are often around the cost/benefit of the actions that the model would have us take in the field to act on its predictions. This is where more involved simulations enable us to make more meaningful assessments of the future impact of a model.</span></p>
<p><span lang="EN-GB" /><span lang="EN-GB" /><span lang="EN-GB">Next we will take a real life example to better illustrate how this step can work in practiceâ€¦<br />
</span>
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2007/03/15/how-to-do-predictive-analytics-part-5/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Web analytics consolidation in Europe</title>
		<link>http://www.applied-insights.co.uk/news/2007/01/19/web-analytics-consolidation-in-europe/</link>
		<comments>http://www.applied-insights.co.uk/news/2007/01/19/web-analytics-consolidation-in-europe/#comments</comments>
		<pubDate>Fri, 19 Jan 2007 11:05:18 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Web analytics</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2007/01/19/web-analytics-consolidation-in-europe/</guid>
		<description><![CDATA[Yesterday Omniture announced their acquisition of Instadia the Scandinavian web analytics company. This will probably explain why I have had trouble getting hold of senior executives from Instadia and Omniture Europe over the last week! Coincidentally I had been approaching both companies to see if they would take part in a &#8220;blogchat&#8221; over the coming [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday Omniture <a title="Omniture press release" target="_blank" href="http://www.omniture.com/instadia">announced their acquisition of Instadia</a> the Scandinavian web analytics company. This will probably explain why I have had trouble getting hold of senior executives from Instadia and Omniture Europe over the last week! Coincidentally I had been approaching both companies to see if they would take part in a &#8220;<a title="Blogchat" href="http://www.applied-insights.co.uk/news/category/blogchat/">blogchat</a>&#8221; over the coming weeks.</p>
<p>I think this is an interesting acquisition and perhaps not wholly unexpected. I often wonder about the future of European vendors in this space and what the long term prospects are for companies such as Speedtrap, Site Intelligence, Red Eye and others. The larger European vendors all have some distinctive feature to them such as a particular data collection methodology or visualisation tools, or they have a strong penetration within certain markets or channels.</p>
<p>With Instadia it looks like a bit of both. Obviously they had a strong presence in Scandinavia and were beginning to make moves into other markets such as the UK and it looks like this was a good fit for Omniture. However, I have had the opportunity over the past few months to take a good look at Instadia and there are some neat aspects to their technology as well which will hopefully manifest themselves in SiteCatayst at some point in the future.</p>
<p>One of the unique things about the Instadia product is their integrated survey tool. As you may know from other articles and blogs I&#8217;ve written I&#8217;m a big fan of using multiple sources of data including surveys to fully evaluate the performance of the online channel. A number of the web analytic tools can now &#8220;integrate&#8221; survey data and aloow users to do some basic filtering of the data based on survey responses. However, the Instadia tool has a survey tool built into it so that you can design, script and launch the survey from within the system itself and the data is fully integrated at the visitor level with the site behavioural data. I think this is very powerful.</p>
<p>If I had a small gripe about Instadia it was that I thought that the interface was a bit clunky. But I also tend to segment systems between tools that are predominantly reporting orientated and tools that are truly analytical. As a sweeping generalisation the latter group usually have clunky interfaces. It&#8217;s the nature of the beast. I think Instadia falls into &#8220;analytical&#8221; camp and the other feature that it has that I like is its &#8220;Filter Builder&#8221;. This is a visual way of creating filters and segments and it enables quite complex filters to be built very easily. It&#8217;s one of those neat ideas that works very well.</p>
<p>So I think that Omniture have picked up a little gem of a company with Instadia. From what I know of the two companies it should be a good fit. Good luck to Anders and the rest of the Instadia team. I hope that we see some of the more distinct Instadia features on the Omniture roadmap at some point. It would shame to loose them from the space.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2007/01/19/web-analytics-consolidation-in-europe/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>How to do Predictive Analytics - Part 4</title>
		<link>http://www.applied-insights.co.uk/news/2007/01/09/how-to-do-predictive-analytics-part-4/</link>
		<comments>http://www.applied-insights.co.uk/news/2007/01/09/how-to-do-predictive-analytics-part-4/#comments</comments>
		<pubDate>Tue, 09 Jan 2007 11:05:46 +0000</pubDate>
		<dc:creator>John McConnell</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Segmentation</category>
	<category>Predictive analytics</category>
	<category>Data mining</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2007/01/09/how-to-do-predictive-analytics-part-4/</guid>
		<description><![CDATA[Step 3 â€“ Data Preparation
Anyone who has ever analysed data knows what a nuisance it can be. Whenever we want to analyse it in a new style we often have to manipulate in some way before we can do so. The more &#8220;raw&#8221; the data, or the more fundamentally different the analysis, the more work [...]]]></description>
			<content:encoded><![CDATA[<p><strong><span lang="EN-GB">Step 3 â€“ Data Preparation<br />
</span></strong><span lang="EN-GB">Anyone who has ever analysed data knows what a nuisance it can be. Whenever we want to analyse it in a new style we often have to manipulate in some way before we can do so. The more &#8220;raw&#8221; the data, or the more fundamentally different the analysis, the more work we typically have to do to get into the shape we need for the analysis we want to perform.</span></p>
<p><span lang="EN-GB"><br />
<span lang="EN-GB" /></span><span lang="EN-GB">As I mentioned in an earlier blog; if the primary data source is a <a href="http://en.wikipedia.org/wiki/Data_warehouse" target="_blank">data warehouse</a> which contains well structured, rigorously cleaned and de-duped data, then this is usually the best starting point. But it is only that. The shape of the data tables in the warehouse will inevitably have been defined with a certain type of analysis in mind; most often to produce the standard business intelligence style of reports. You might get lucky and find you can use that data as-is for the kinds of predictive analysis you have in mind. The chances are that you wonâ€™t, and that you will have to re-structure the data in preparation for that analysis.<br />
</span><span lang="EN-GB" /><span lang="EN-GB">Furthermore it may well contain aggregated data, perhaps an <a href="http://en.wikipedia.org/wiki/OLAP" target="_blank">OLAP</a> structure of some sorts, which may allow you to produce time series forecasts but which will most likely contain data which is too summarised for most other kinds of predictive analysis. If this is the case then youâ€™ll probably need to go back and locate the sources of the summary data. That might not be a trivial exercise.</span></p>
<p><strong><span lang="EN-GB">How did we get here?<br />
</span></strong><span lang="EN-GB">In the previous steps, discussed in other blog entries, we effectively designed the analyses which we intend to perform at the next step; Data Modelling. In <a href="http://www.applied-insights.co.uk/news/2006/11/22/how-to-do-predictive-analytics-part-3/" target="_blank">Data Understanding</a> we learnt all about the existing data structures, formats and sources and we started to look for patterns in those sources which are pertinent to the analytical objectives we defined at the <a href="http://www.applied-insights.co.uk/news/2006/11/01/how-to-do-predictive-analytics-part-2/" target="_blank">start of the process</a>. The truth is that, to perform the exploration, we would have had to prepare the data to some extent. But this is the point where we get serious and apply the necessary data management steps to get the data into the shape(s) required for the main task; predictive modelling.<br />
</span><span lang="EN-GB" /><span lang="EN-GB">At the top level this means we end up doing one or more of the following:</span></p>
<ul>
<li><span lang="EN-GB"><strong>Cleaning data<br />
</strong></span><span lang="EN-GB">This may not be necessary depending on how &#8220;clean&#8221; the original source is (though it is not unusual to find data problems when we start to analyse it in an unfamiliar way). Our previous exploration should have revealed any errors, or inconsistencies, which need to be corrected, or excluded.</span></li>
<li><span lang="EN-GB"><strong>Merging data from multiple sources<br />
</strong></span><span lang="EN-GB">If you are lucky the data will be in a single data file, or a single table in a database. If you are unlucky it will be in a variety of disparate sources with different formats in various locations</span></li>
<li><span lang="EN-GB"><strong>Shaping it for the analysis<br />
</strong></span><span lang="EN-GB">Often the most time consuming element. A classic example is where we have data with a sequence to it; typical if we are looking to predict the likelihood of a an event given a set of previous events. The starting point is typically data in a database which often contains all event transactions. In order to model it in a way which mimics how we will look to apply (deploy) the model we have to define an appropriate point in history as the baseline, e.g. if we are interested to know what will happen after March 2007 we might use March 2006 as that anchor point. We then have to restructure the incoming data to derive all the interesting predictors e.g. transaction frequency, transaction value in previous months, years, etc. from March 2006 backwards. We also need to have a separate data partition which contains the &#8220;what happened next&#8221; data for a period after March 2006 that corresponds to the period weÂ want to predict into in 2007; so if we are interested to see which customers are likely to churn in April 2007, then April 2006 is likely to be the best month to look at it 2006. NB. Modelling and Evaluation (see later) will help test that hypothesis.</span></li>
<li><span lang="EN-GB"><strong>Deriving new data elements</strong><br />
</span><span lang="EN-GB">Typically new fields(variables). In our exploration, for example, we may have found that there appears to be a strong relationship between the rate at which a customer buys products and the likelihood that they will churn. In many cases that rate will not exist as a separate measure in the current data, so we create it in this step.</span></li>
<li><span lang="EN-GB"><strong>Describing it</strong><br />
</span><span lang="EN-GB">Labelling, formatting and generally documenting the data in a way which helps the analyst, or other viewers of the data, to understand its meaning.<br />
</span><span lang="EN-GB" /><span lang="EN-GB">The outcome of the above is a set of tables, or data files, which are in the shape we believe we need for the modelling effort we have in mind.</span></li>
</ul>
<p><strong><span lang="EN-GB">You [almost always] never get it right first time<br />
</span></strong><span lang="EN-GB">Weâ€™ve mentioned it before but it is worth re-stating that much of the CRISP process is iterative. Quite often we will get into the modelling step, for example, and discover a potential relationship that looks interesting but which we have to go back to the preparation process to derive. Frequently, because we are often building complex data handling processes from scratch, we just make mistakes which need to be corrected.<br />
</span><span lang="EN-GB" /><span lang="EN-GB">With large datasets the preparation time can be significant; It can take hours â€¦ sometimes days, so mistakes and re-runs can be costly. Hence wherever possible it is a good idea to test the process using data subsets, ideally random, or at least representative, samples. Samples can also be used to boost productivity when we get into the analysisâ€¦more on that next time.</span></p>
<p><strong><span lang="EN-GB">An</span></strong><span lang="EN-GB"> <strong>example</strong><br />
</span><span lang="EN-GB">Data collected in the web channel is a great illustration of this point. We work with a lot of this kind of data typically for web sites with large numbers of visitors; usually millions per week. These sites inevitably have a web analytics tool which they use to analyse key metrics of site performance. Most often we are interested to apply predictive and/or segmentation methods to the site data. This typically involves:</span></p>
<ul>
<li><span lang="EN-GB">Extracting behavioural data from the data warehouse (underlying the web analytics tool) or via a data feed that the analytics vendor provides. More often than not we extract this data to a number of text format files.</span></li>
<li><span lang="EN-GB" /><span lang="EN-GB">For our <a href="http://www.applied-insights.co.uk/our-services/web-analytics/customer-journey-analysis/" target="_blank">Customer Journey Framework</a> we usually have an additional data source in the form of on-line surveys. Depending on the analytics tool that the client is using we have developed a number of ways of linking the data that the visitor provides as a respondent in the survey to the behavioural data which maps that visitors journey through the site.</span></li>
<li><span lang="EN-GB" /><span lang="EN-GB">The data we end up with can be at various levels but more often than not it is at the individual page or individual click level (remember these sites have millions of visitors so the number of records gets multiplied up). We take this data and aggregate it over a period of time to end up with tables for analysis which are at the visit and/or visitor level. Each of the resulting records will contain fields of interest; e.g. site content viewed, visit intentions and conversion goals which we will use for analysis.</span></li>
</ul>
<p><span lang="EN-GB" /><span lang="EN-GB">For a typical site processing a weeks worth of data into the shape needed for analysis can take 4-6 hours.</span></p>
<p><strong><span lang="EN-GB">Which tools?<br />
</span></strong><span lang="EN-GB">As is often the case the choice of tool for data management comes down to those that the analyst/data is familiar with. Database tools are all about this type of work and often the best approach is to aim to construct data mining tables inside a relational database. This can be achieved using a combination of SQL, ETL tools and other database utilities.<br />
</span><span lang="EN-GB">Generally speaking; the more sophisticated the predictive tool itself the greater the data management capabilities which are built in. So SPSS, SPSS Clementine, SAS and SAS Enterprise Miner offer a broad range of data handling procedures.</span></p>
<p><span lang="EN-GB"><strong>So much for progress</strong>Even though we have more and better tools, and faster hardware,Â with which to manipulate data these days this is offset by the increasing volume of data, complexity of structures and number of sources. Hence the old adage that data management consumes more of a data analysis effort than the analysis itself typically holds as much today as it ever did. But it is a necessary pain to get us to the point where we can get to the next step which is at the core of the predictive process; Data Modelling.<br />
</span>
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2007/01/09/how-to-do-predictive-analytics-part-4/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Other things to think about when selecting a web analytics system</title>
		<link>http://www.applied-insights.co.uk/news/2006/12/08/other-things-to-think-about-when-selecting-a-web-analytics-system/</link>
		<comments>http://www.applied-insights.co.uk/news/2006/12/08/other-things-to-think-about-when-selecting-a-web-analytics-system/#comments</comments>
		<pubDate>Fri, 08 Dec 2006 13:50:26 +0000</pubDate>
		<dc:creator>Neil Mason</dc:creator>
		
	<category>Applied Insights Blog</category>
	<category>Analytics strategy</category>
	<category>Web analytics</category>
		<guid isPermaLink="false">http://www.applied-insights.co.uk/news/2006/12/08/other-things-to-think-about-when-selecting-a-web-analytics-system/</guid>
		<description><![CDATA[In various articles and other places, I have shared thoughts about how to go about selecting your web analytics system. I believe it&#8217;s important to have a process but not to be over procedural about the whole thing. I think it&#8217;s a good idea to be very clear about what you want to measure rather [...]]]></description>
			<content:encoded><![CDATA[<p>In various articles and other places, I have shared thoughts about how to go about selecting your web analytics system. I believe it&#8217;s important to have a process but not to be over procedural about the whole thing. I think it&#8217;s a good idea to be very clear about what you want to measure rather than to become a victim of paralysis by analysis or &#8220;death by report overload&#8221;.</p>
<p>I&#8217;m also thinking that there are some other issues that you should consider when thinking about which system is best for you, particularly in the UK at the moment. It&#8217;s nothing to do with KPIs, funnels and the like. It&#8217;s about &#8220;fit&#8221; and &#8220;partnership&#8221;.</p>
<p>Let&#8217;s have a look at what&#8217;s going on at the moment. In the US, the web analytics systems market is beginning to mature. Most companies who are doing any serious business online have already invested in a web analytics system. In fact they may be on their third or fourth. As the online business has become more important, the web analytics system has become a bit more &#8220;mission critical&#8221;. Efforts have begun to integrate the web analytics system with other corporate systems. So, the cost and pain of switching from one system to another is getting higher and higher. The net result, as I see it, is that in the US, more companies have a system in place and organisations are becoming increasingly reluctant to switch and so growth opportunities for the vendors are diminishing.</p>
<p>This is not the case over here in the UK and the rest of Europe. Many organisations are still in the process of sorting out their online channel measurement strategies and are looking to upgrade their capabilities. Across Europe existing localised solutions are being centralised to help manage costs and enable consistency of measurement across markets. There is a lot of &#8220;buzz&#8221; in the market and the various vendors are busy responding to proposal requests, doing pitches and making the sale.</p>
<p>This is all great but the trouble is there aren&#8217;t enough people around to satisfy the current demand and I think that this is having an impact on service levels in the industry. Going back to my comments about what else to look out for when choosing a vendor, most of the main systems these days are basically functionally equivalent. One system might have some &#8220;bells&#8221; and another one might have some &#8220;whistles&#8221;. Or you might prefer the way that one system does its conversion funnel over that of another. These may be big enough differences from which to make a decision but they also may not. So how else can you decide? Well, it might be on price or it might be on service.</p>
<p>If you&#8217;re making serious investments in your web analytics capabilities, both in the system you&#8217;re buying and the resources to manage it, then I think you need to look at the kind of service and support you can expect <em>after </em>the salesperson has done the deal. I don&#8217;t think that&#8217;s it just about whether they have extensive online support capabilities or whether they have a department called &#8220;client services&#8221; or &#8220;best practices&#8221;, it&#8217;s more about whether they seem interested in you as a business rather than you as a sale. Ask to meet the people who are going to be managing your account before you sign the deal and see what you make of them. Are they people you want to do business with?</p>
<p>For all the reasons that I mentioned earlier about the current market situation in the US, if you are going through a vendor selection programme now, you don&#8217;t want to find yourself in the positron of wanting to make a switch in 18 months time. So it&#8217;s important to get it right this time round and that means not only making sure that the system is functionally fit-for-purpose but also that the vendor is a company that you feel comfortable working with for a number of years.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.applied-insights.co.uk/news/2006/12/08/other-things-to-think-about-when-selecting-a-web-analytics-system/feed/</wfw:commentRSS>
		</item>
	</channel>
</rss>
