LWN: Comments on "Data Warehousing 101" https://lwn.net/Articles/452307/ This is a special feed containing comments posted to the individual LWN article titled "Data Warehousing 101". en-us Thu, 02 Oct 2025 11:43:11 +0000 Thu, 02 Oct 2025 11:43:11 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net Refreshing overview https://lwn.net/Articles/454291/ https://lwn.net/Articles/454291/ lab <div class="FormattedComment"> Can only second that! Very informative article. Thanks Josh.<br> </div> Sat, 06 Aug 2011 09:36:40 +0000 Data Warehousing 101 https://lwn.net/Articles/454033/ https://lwn.net/Articles/454033/ Lennie <div class="FormattedComment"> Have you considered a "noSQL" solution like ?: <a href="http://couchdb.apache.org/docs/intro.html">http://couchdb.apache.org/docs/intro.html</a> <a href="http://couchdb.apache.org/docs/overview.html">http://couchdb.apache.org/docs/overview.html</a><br> </div> Thu, 04 Aug 2011 00:12:17 +0000 Refreshing overview https://lwn.net/Articles/453568/ https://lwn.net/Articles/453568/ mathstuf <div class="FormattedComment"> Better: Make buttons for +1 and -1. Then have the ability to get git --diffstat-like output for a thread.<br> </div> Sat, 30 Jul 2011 21:46:04 +0000 Refreshing overview https://lwn.net/Articles/453496/ https://lwn.net/Articles/453496/ oak <div class="FormattedComment"> Maybe the LWN commenting system should add "+1" comments as "+" characters to the header or footer of the previous post instead?<br> <p> </div> Sat, 30 Jul 2011 10:06:19 +0000 Queuing platforms for ETL? https://lwn.net/Articles/453197/ https://lwn.net/Articles/453197/ gramosalvarez <div class="FormattedComment"> Great article, indeed.<br> Still, I don't see where ActiveMQ or RabbitMQ stand as alternatives for ETL tools. <br> Could you elaborate on this, please?<br> Thanks!<br> <p> </div> Thu, 28 Jul 2011 11:06:01 +0000 Data Warehousing 101 https://lwn.net/Articles/453152/ https://lwn.net/Articles/453152/ jberkus <div class="FormattedComment"> KVAML,<br> <p> Huh. See, every time somebody thinks they've invented new DB technology ...<br> <p> ;-)<br> </div> Thu, 28 Jul 2011 01:37:14 +0000 KDB https://lwn.net/Articles/452964/ https://lwn.net/Articles/452964/ Jaquatech <div class="FormattedComment"> <a rel="nofollow" href="http://kx.com/products.php">http://kx.com/products.php</a> sounds like it<br> -Jan<br> </div> Tue, 26 Jul 2011 16:29:58 +0000 Data Warehousing 101 https://lwn.net/Articles/452766/ https://lwn.net/Articles/452766/ kvaml <div class="FormattedComment"> "Column-store databases (were) Invented in 1999"<br> <p> Actually the old Dartmouth Time Sharing System (DTSS) had a column store database known as project FIND in the 1970's.<br> </div> Sun, 24 Jul 2011 22:06:26 +0000 Data Warehousing 101 https://lwn.net/Articles/452665/ https://lwn.net/Articles/452665/ gerv <div class="FormattedComment"> Just the sort of thing I subscribe to LWN for. Great :-)<br> <p> Gerv<br> </div> Sat, 23 Jul 2011 05:50:12 +0000 Refreshing overview https://lwn.net/Articles/452650/ https://lwn.net/Articles/452650/ ballombe <div class="FormattedComment"> Well, some people take that as a hint that +1 are not informative enough to warrant a comment.<br> </div> Fri, 22 Jul 2011 22:34:31 +0000 KDB https://lwn.net/Articles/452610/ https://lwn.net/Articles/452610/ jberkus <div class="FormattedComment"> Macson,<br> <p> Link? That's not one I've heard of before.<br> </div> Fri, 22 Jul 2011 17:12:55 +0000 KDB https://lwn.net/Articles/452559/ https://lwn.net/Articles/452559/ macson_g <div class="FormattedComment"> Interesting article!<br> <p> I'd like to add KDB to to the list; its ubiquitous in the finance industry. <br> <p> KDB is a C-store database with embedded Q interpreter. Q is a powerful and flexible programming language and having it running at the core of the DB allows for complex statistical analysis to be executed fast even on very large datasets (think terabytes of data and billions of records).<br> </div> Fri, 22 Jul 2011 09:35:09 +0000 Data Warehousing 101 https://lwn.net/Articles/452477/ https://lwn.net/Articles/452477/ jeremiah <div class="FormattedComment"> Except for the languages listed, that's what I've got. I found Neo4J difficult because of the lack of flexibility in defining relationships, ie they couldn't be dynamic. At times I even toy with the idea of doing something on my own at the file system level, but it's just not worth the effort, esp trying to redo a lot of the features that the Postgres folks have worked so hard over the years to develop. And I'm sure I could never approach their level of scalability on my own. Thanks for the input. <br> </div> Thu, 21 Jul 2011 20:28:39 +0000 Data Warehousing 101 https://lwn.net/Articles/452469/ https://lwn.net/Articles/452469/ jberkus <div class="FormattedComment"> Jeremiah,<br> <p> If only there were! There's a few different implementations of graph databases, of which Neo4J is probably the leader, but every one I've seen is hard to use and has some substantial scalability limitations.<br> <p> I think Postgres &amp; WITH RECURSIVE, plus pl/perl or pl/python stored procedures for the decomposition may be your current best bet.<br> </div> Thu, 21 Jul 2011 19:41:09 +0000 Refreshing overview https://lwn.net/Articles/452466/ https://lwn.net/Articles/452466/ louie <div class="FormattedComment"> +1.<br> <p> (Esteemed Editor, your comment system's "less than ten characters" rule should have an exception for +1s ;)<br> </div> Thu, 21 Jul 2011 19:27:05 +0000 Data Warehousing 101 https://lwn.net/Articles/452447/ https://lwn.net/Articles/452447/ jeremiah <div class="FormattedComment"> Nice article. Now if I could find some kind of DB that's good at storing deconstructed XML or JSON with version information I'd be a happy camper. I'm always drawn to map-reduce, but always find myself using going back to RDBMS w/ recursive queries. <br> <p> I've got a situation where every attribute and element is version-ed and stored separately. Everything is treated as a node, and the relationships between nodes are defined by the query. Needless to say, it's a little trippy. Or to say it another way, we can store to different directed graphs, and define the structure of a third, and have the DB/code fill in the values of the 3rd, using the first two. <br> <p> Anyone know of some kind of db architecture that would be inherently good at this?<br> </div> Thu, 21 Jul 2011 17:37:06 +0000 Refreshing overview https://lwn.net/Articles/452418/ https://lwn.net/Articles/452418/ sumanah <div class="FormattedComment"> I feel way more ready to think about data warehousing challenges (and opportunities) now! Thanks, Josh &amp; LWN.<br> </div> Thu, 21 Jul 2011 15:42:40 +0000