Log Buffer #11: A Carnival of the Vanities for DBAs

September 22nd, 2006 by Mark Rittman

Welcome to this week’s Log Buffer, where we’ll take a look at some of
the hot topics and blog postings over the last few days.

As it’s Friday and we’re all starting to wind down for the weekend, we’ll start
off with Tom Kyte on The Daily WTF who has come
up with an example of the ultimate
extensible database
– as long as your users never want to report off of it.
On a similar topic, Jeff
Smith’s SQL Server Blog
goes through his "Top
10 Things I Hate About SQL Server"
, with #2 being "Datatype
conversions …What is the deal with data types?  I just use VARCHAR for
everything, since that’s the only way to avoid stupid error messages when I add
data to my tables. Works great,
until I want to get the month of a date or add two numbers — stupid SQL Server
makes me convert() the value first. How
dumb is that? SQL doesn’t
know how to add "12" and "5" — it thinks it’s
"125" !!!! To make things
even worse, SQL Server gives me errors half the time saying it CAN’T convert! Uh, you are a computer; you should be able to figure this stuff out."

Jonathan Lewis warns
us about downloading and running scripts you find on the
internet
, whilst Jeff Moss writes about helping the Oracle CBO by adding
constraints
, and then realises that it’s actually
all in Jonathan’s CBO book
. Jeff’s presented a couple of good
papers at the UKOUG on data
warehouse tuning
and partitioning
and compression
which are well worth reading if you’ve got a
spare half an hour.

Jeff Hunter finds the idea of managers
actually judging database costs on annual TCA per user amusing
,
thinking it’s better to perform real-world tests using real
applications rather than rely on artificial benchmarks, whilst Pete
Finnigan celebrates two
years of Bogging
(is that anything like dogging, Pete?). 

Guy Bowerman from Informix talks about Finite
Field arithmetic and Communitive Division Rings
, and
subsequently risks a breach in the space-time continuum, whilst Curt
Monash
asks whether data
warehousing is all now about sequential access
(perhaps we
should ask Fabian?).

Daniel Schneller presents a preview of the
new
mySQL Index Analyzer
, whilst Eddie Awads comes up with a
way of making NULL=NULL
(and does cause a breach in the
space-time continuum).

“Does anyone
still do logical data models”
asks Ronald (depends
if your client is paying fixed price or time and materials, in my experience…) and thinks about the
difference between hobbyists and professionals
. Duncan Lamb
points to an article on Oracle
Rootkits 2.0
(available shortly with Oracle 11g?) whilst the Oracle WTF guys post instructions on how
to capture spiders humanely using a one of Tom Kyte’s books
-
the net surely closes on Mr. Burns; how long can this man stay free?

David Wheeler talks about Batch
Updates with PL/pgSQL
(zzzzzzzzzzzz….) whilst
(Groundskeeper?) Willie talks about the secrets of a successful upgrade
to DB2 for z/OS Version 8. Time for some data warehousing articles I
think…

Andy Hayler, one of the founders of Kalido and a writer on data
warehousing and enterprise software, takes a look at data
warehousing architectures over time
and concludes that whilst
separate data marts and techniques such as EII have their place, if you
want proper reporting against multiple transaction systems you really
need a proper data warehouse, and an approach to deal with the issue of
master
data management
. Although Andy typically pushes the idea of
packaged data warehousing, he makes some good observations in his blog
about the enterprise software market and in particular the efforts of
vendors such as Oracle and Microsoft to eat into the market previously
held by the specialist vendors. Check out “Data
Warehousing Dead”
by William McKnight and “Logical
Data Modelling: A Key to a Successful Data Warehouse Implementation”

whilst you’re there.

Lucas Jellema from AMIS comes up with some questions and approaches for
tuning
SQL statements using execution plans
, Neshan Bardolliwalla
talks about SAP’s
new in-memory BI accelerator
that requires no tuning or need
to build aggregates (as my son says, "yeah…right."), whilst David Walker’s Data Management
Site
has published and overview
architecture for Enterprise Data Warehouses
.

Gabriel Fuchs talks about Business
Intelligence & Different Cultures
, and concludes that
in a BI&W project, cultural considerations may equal technical
ones in terms of importance. Finishing off on data warehousing, I’ve
just had an article published on Oracle Technology Network on Data
Profiling and Automatic Cleansing using Oracle Warehouse Builder 10gR2
,
and I’ve also come up with some interesting issues around Warehouse
Builder’s handling of slowly changing dimensions
that you
might find useful – if you can make it through to the end of what’s a fairly
lengthy investigation.

That’s it for this week, thanks to the team over at Pythian for inviting me to do this Log Buffer, and have a great weekend.

Comments

  1. Ronald Bradford Says:

    What I haven’t seen in the recent blog spots this week is any news on the Oracle release of Oracle® Berkeley DB Java Edition 3.0. Quoting from the Oracle Press Release
    17-MAY-2006 At the JavaOne Developer Conference today, Oracle announced the general availability of Oracle® Berkeley DB Java Edition Release 3.0. This is the third major release in three years of the high-performance, pure Java embeddable database and demonstrates both Oracle’s commitment to continued development of the Oracle Berkeley DB product family and to the open source community.
    PS: Your link to “the difference between hobbyists and professionals” should be http://blog.arabx.com.au/?p=345

  2. Nenshad Bardoliwalla Says:

    Hi Mark,
    I respect you as one of the leading minds as in the Oracle Analytics and Business Intelligence space. I have read many of your posts on Oracle Business Intelligence Enterprise Edition and you are spot on. I should know. I was responsible for Siebel Analytics Server as a product manager and prior to this was in Siebel’s Expert Services group as the lead expert consultant at the company on Siebel Analytics.
    I was flattered to see that you mention above that “Neshan Bardolliwalla talks about SAP’s new in-memory BI accelerator that requires no tuning or need to build aggregates (as my son says, “yeah…right.”)” While I am not at liberty to disclose the detailed internals of the product, consider that a 64-bit computer can address 16 exibytes of memory.
    So, if you were take your ENTIRE data warehouse at its MOST granular level, without ANY aggregates whatsoever, put it in main memory, index that data, and then use clever on-the-fly search techniques to locate that data, you could most certainly eliminate the need for aggregates or any tuning whatsoever. I’ve seen it with my own eyes, and yes, I didn’t believe it when I heard it either.
    With today’s technology and Moore’s law continuing to operate, the need for disk-based database systems will continue to become less relevant as memory-based systems become cheaper to exploit and algorithms continue to be developed that allow one to search the addressable space of memory that a 64 bit processor is capable of addressing.
    Best Regards,
    Nenshad

  3. mark Says:

    Hi Nenshad
    Good to hear from you – if you take a look at my posting a day later, you’ll see that I actually took back what I said about the technology, I agree it does look very impressive.
    http://www.rittman.net/2006/09/25/saps-new-bi-accelerator/
    Unfortunately it’s unlikely I’ll get to look at in more detail – because I mainly work with Oracle technology – but I agree, it does sound very good. I’ll keep abreast of your blog though, and try and keep up with where the technology goes.
    best regards
    Mark

Website Design & Build: tymedia.co.uk