<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.3.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>BI-Nerd.com</title>
	<link>http://www.bi-nerd.com</link>
	<description></description>
	<pubDate>Thu, 09 Oct 2008 13:18:21 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.3</generator>
	<language>en</language>
			<item>
		<title>Ab Initio - The Origins Of Their Secrecy</title>
		<link>http://www.bi-nerd.com/ab-initio-the-origins-of-their-secrecy/</link>
		<comments>http://www.bi-nerd.com/ab-initio-the-origins-of-their-secrecy/#comments</comments>
		<pubDate>Sat, 04 Oct 2008 02:46:33 +0000</pubDate>
		<dc:creator>Thomas</dc:creator>
		
		<category><![CDATA[Ab Initio]]></category>

		<category><![CDATA[Editorial]]></category>

		<guid isPermaLink="false">http://www.bi-nerd.com/ab-initio-the-origins-of-their-secrecy/</guid>
		<description><![CDATA[Ah&#8230; Ab Initio. The mystery guest with the cool leather jacket at the salsa party of ETL tools, who doesn&#8217;t say much but always ends up attracting all the girls&#8230; If only because all the other guests have nerdy glasses and keep trying to explain the virtues of enterprise-wide data synchronization.
They are secretive. They are suave. They are a mystery.
Call it a [...]]]></description>
			<content:encoded><![CDATA[<p align="justify"><a href="http://www.bi-nerd.com/wp-content/tmc.gif" title="Thinking Machines Cup"></a><a href="http://www.bi-nerd.com/wp-content/cm5_lg.jpg" title="CM-5"><img src="http://www.bi-nerd.com/wp-content/cm5_lg.thumbnail.jpg" alt="CM-5" /></a>Ah&#8230; <a href="http://www.abinitio.com/" title="company website">Ab Initio</a>. The <a href="http://www.bi-nerd.com/ab-initio-the-dark-horse-of-etl/" title="some more to read">mystery guest</a> with the cool leather jacket at the salsa party of ETL tools, who doesn&#8217;t say much but always ends up attracting all the girls&#8230; If only because all the other guests have nerdy glasses and keep trying to explain the virtues of enterprise-wide data synchronization.</p>
<p align="justify">They are secretive. They are suave. They are a mystery.</p>
<p align="justify">Call it a hobby. A company that acts like it is operating in a world of cold war spy agencies rather than the slightly dry and uneventful theatre of database extraction is almost bound to make one curious. On one forum I recently saw a consultant wail with indignation, saying they would not even respond to a potential sale before he signed a non-disclosure agreement. On another I read a researcher declare that he would be excluding them from his market overview for the very same reason. They won&#8217;t talk, they won&#8217;t show, and if you <em>do talk </em>about them there is always the possibilty that there will be men in matching raincoats coming to have a friendly, private, and potentially terminal chat with you. One cannot but stop and wonder: why? Why, for goodness&#8217; sake, would a company doing business in a competitive market want to act this way? What&#8217;s the point of it?</p>
<p align="justify">As it turns, there really <em>is </em>a reason. Or at least an explanation. I was most pleased when I found this article, titled &#8220;<a href="http://www.inc.com/magazine/19950915/2622_pagen_3.html" title="The Rise and Fall of Thinking Machines">The Rise and Fall of Thinking Machines</a>&#8221; from 1995. It makes for good reading.</p>
<p align="justify"> <a href="http://www.bi-nerd.com/ab-initio-the-origins-of-their-secrecy/#more-66" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bi-nerd.com/ab-initio-the-origins-of-their-secrecy/feed/</wfw:commentRss>
		</item>
		<item>
		<title>BI-Research : Factors influencing the FIT between BI End-user and BI-Solution</title>
		<link>http://www.bi-nerd.com/bi-research-factors-influencing-the-fit-between-bi-end-user-and-bi-solution/</link>
		<comments>http://www.bi-nerd.com/bi-research-factors-influencing-the-fit-between-bi-end-user-and-bi-solution/#comments</comments>
		<pubDate>Tue, 12 Aug 2008 14:47:19 +0000</pubDate>
		<dc:creator>hessel</dc:creator>
		
		<category><![CDATA[Editorial]]></category>

		<category><![CDATA[Various]]></category>

		<guid isPermaLink="false">http://www.bi-nerd.com/bi-research-factors-influencing-the-fit-between-bi-end-user-and-bi-solution/</guid>
		<description><![CDATA[Do you know your BI End-user(s)? Are you wondering why your BI-applications are barely used? Do you think technology is enough? Do you want to want to get maximum benefit from your BI investment(s)? During the next six months a scientific research project is conducted at Capgemini by a graduate student of Utrecht University, the [...]]]></description>
			<content:encoded><![CDATA[<p>Do you know your BI End-user(s)? Are you wondering why your BI-applications are barely used? Do you think technology is enough? Do you want to want to get maximum benefit from your BI investment(s)? During the next six months a scientific research project is conducted at Capgemini by a graduate student of Utrecht University, the Netherlands to get an answer to these questions. The (main) goal of this research is to obtain insight into the factors influencing the fit between Business Intelligence Solution and End-user.  </p>
<p> <a href="http://www.bi-nerd.com/bi-research-factors-influencing-the-fit-between-bi-end-user-and-bi-solution/#more-65" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bi-nerd.com/bi-research-factors-influencing-the-fit-between-bi-end-user-and-bi-solution/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The ETL Generator - A Holy Grail  (or is it?)</title>
		<link>http://www.bi-nerd.com/the-etl-generator-a-holy-grail-or-is-it/</link>
		<comments>http://www.bi-nerd.com/the-etl-generator-a-holy-grail-or-is-it/#comments</comments>
		<pubDate>Thu, 26 Jun 2008 12:45:50 +0000</pubDate>
		<dc:creator>Thomas</dc:creator>
		
		<category><![CDATA[Data Warehousing]]></category>

		<category><![CDATA[Super Fancy Sexy]]></category>

		<category><![CDATA[Trends]]></category>

		<guid isPermaLink="false">http://www.bi-nerd.com/the-etl-generator-a-holy-grail-or-is-it/</guid>
		<description><![CDATA[For some reason I can&#8217;t quite fathom myself either, I spend considerable amounts of time in all manner of datawarehouse / ETL related product development groups. (A substantial bit of that time, I should add, being my own; major system integrators put great value in R&#38;D - just as long as it doesn&#8217;t eat into the billable hours.) And whether they [...]]]></description>
			<content:encoded><![CDATA[<p align="justify"><a href="http://www.bi-nerd.com/wp-content/bart-simpson-generator.gif" title="Auto Generated"><img align="baseline" width="128" src="http://www.bi-nerd.com/wp-content/bart-simpson-generator.thumbnail.gif" alt="Auto Generated" height="68" style="width: 128px; height: 68px" /></a>For some reason I can&#8217;t quite fathom myself either, I spend considerable amounts of time in all manner of datawarehouse / ETL related product development groups. (A substantial bit of that time, I should add, being my own; major system integrators put great value in R&amp;D - just as long as it doesn&#8217;t eat into the billable hours.) And whether they called themselves Focus Groups, or Special Interest Groups, or Knowledge Groups, what they basically are all looking for is the answer to one simple question: how can we do what we do already, only better? And of course when we say better mostly we mean faster and, above all, cheaper. If your project leaders are anywhere near as sharky as mine, you&#8217;ll find that sooner or later you have to come up with a good reason why this ETL stuff is taking so bloody long.  </p>
<p align="justify">Not that it is unfair of them to ask. I mean, let&#8217;s face it: the Business Intelligence projects that we work on generally come with price tags that would enable you or me to spend the rest of our lives on pearly white beaches, were the money to be deposited on our bank accounts directly. And while we&#8217;re at it, let&#8217;s double face it: a substantial amount of that money - up to 75 % I&#8217;ve heard mention - is spent not on the reports that are the end result, but on building the data warehouse and the ETL that is used to fill it with juicy numbers. And to top it all off, let&#8217;s triple face it: a whole lot of those data mappings we were building in that time really weren&#8217;t all that complicated. So the question, at least, is a valid one.</p>
<p align="justify"> <a href="http://www.bi-nerd.com/the-etl-generator-a-holy-grail-or-is-it/#more-63" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bi-nerd.com/the-etl-generator-a-holy-grail-or-is-it/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Profiling your data</title>
		<link>http://www.bi-nerd.com/profiling-your-data/</link>
		<comments>http://www.bi-nerd.com/profiling-your-data/#comments</comments>
		<pubDate>Sun, 22 Jun 2008 15:08:34 +0000</pubDate>
		<dc:creator>tom</dc:creator>
		
		<category><![CDATA[Analysis]]></category>

		<guid isPermaLink="false">http://www.bi-nerd.com/profiling-your-data/</guid>
		<description><![CDATA[Profiling the data can be described as getting an overview of the data. This overview gives you an indication on the maximum, the average, lowest figure, the highest, number of distincts etc. From such an overview, one is able to know if the data comply to certain business rules. One also gets a hunch of [...]]]></description>
			<content:encoded><![CDATA[<p>Profiling the data can be described as getting an overview of the data. This overview gives you an indication on the maximum, the average, lowest figure, the highest, number of distincts etc. From such an overview, one is able to know if the data comply to certain business rules. One also gets a hunch of the data: if we have amounts, is it expressed in cents or euros. Or if weights are involved, do we have a weight that is expressed in gramms, kilos or tons.</p>
<p>A mathematician considers profiling the data as an easy job: only key statistics (frequencies, averages) are derived and no high level mathematics is involved. I remember the face of a mathematician, who after having explained what data profiling was, asked the question: what the hell are talking about? Data profiling is of more interest to the analyst, who wants to know if the data comply to buisiness rules and who would like to get more acquintance with the data.</p>
<p>Data profiling is mostly done during the analysis of an ETL project when data are to derived from sourcing systems and to be put in a data warehouse. Before building starts, we need to know the likely content of the data. Based on this knowledge, one could establish a dataflow that takes into account the particuliarities of the data. Example: if we know from the analysis that dataes of birth are subject to human error, we might include a check to see if the date of birth is within an acceptable range.</p>
<p>Previously such analysis was done thru a series of queries, in order to calculate the key statistics for every attribute in the database. Nowadays alternatives haven arisen that make life less complicated. I will mention 3 such alternatives:</p>
<p>In the Oracle Warehouse Builder, a wizard exists that creates a nice overview for an external table that is imported in OWB.</p>
<p>The SQL Server 2008 also includes a wizard to do the data profiling.</p>
<p>Lastly freeware tool exists that can be downloaded from &#8220;arrah technologies&#8221;. It has a fully functional version that can be used for a limited period. After that, it should be  bought. To install, one should have the exact version of java that is required (1.5.0_09). Also, the way to get it started is somewhat tricky: it will start with &#8220;java Profiler&#8221;, which is not documented. In their tutorial, they showed a different command line, which did not get it running. Finally the data base connection should be set manually, which can also be tedious. But after all that, we have an intuive profiling tool that does what it should do: give insight into the data by means of key statistics.</p>
<p>In this article, I used much information from Henk Jan te Brake. Thanks to him; errors in this short overview are mine!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bi-nerd.com/profiling-your-data/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Informatica != SSIS (part 1)</title>
		<link>http://www.bi-nerd.com/informatica-ssis-part-1/</link>
		<comments>http://www.bi-nerd.com/informatica-ssis-part-1/#comments</comments>
		<pubDate>Thu, 22 May 2008 10:11:17 +0000</pubDate>
		<dc:creator>Bram@BI-Nerd.com</dc:creator>
		
		<category><![CDATA[Informatica]]></category>

		<category><![CDATA[Microsoft]]></category>

		<category><![CDATA[Tips 'n Tricks]]></category>

		<guid isPermaLink="false">http://www.bi-nerd.com/informatica-ssis-part-1/</guid>
		<description><![CDATA[Hopping from assignment to assignment (or continent to continent for some of you) you&#8217;re bound to work with various pieces of software. From Cognos to Business Objects, from Oracle to MS SQL server, or from Informatica to Microsofts SSIS. It is &#8217;stating the obvious&#8217; to say that all those programs -despite having similar goals and [...]]]></description>
			<content:encoded><![CDATA[<p>Hopping from assignment to assignment (or continent to continent for some of you) you&#8217;re bound to work with various pieces of software. From Cognos to Business Objects, from Oracle to MS SQL server, or from Informatica to Microsofts SSIS. It is &#8217;stating the obvious&#8217; to say that all those programs -despite having similar goals and functionality- work quite different. This article will delve deeper into some of the peculiar differences one might run into when going from Informatica to SSIS (based on my experience).</p>
<p> <a href="http://www.bi-nerd.com/informatica-ssis-part-1/#more-61" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bi-nerd.com/informatica-ssis-part-1/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
