Image Image Image Image Image
Scroll to Top

To Top




13 Kommentare

inDMDW lecture

vonJohannes Hoppe

DMDW – Text exam | Exam time and room

On 24, Mai 2011 | 13 Kommentare | inDMDW lecture | vonJohannes Hoppe

The exam starts today (Friday, 27th of Mai) at 14:00.

Room: t83
Duration: 180 minutes
All you need is an indelible pencil.



Many of you asked for a test-exam. Here it is!

You can estimate your knowledge for the upcoming DMDW-exam with the following questions.
Please note that the real exam will be longer, since it will take 3 hours.

Download: Test-Exam DMDW

Update 2011-05-25:

I’m getting several mails and comments for question number 5.
The question is concentrating on the SSIS control flow tasks that everybody should already know:

Tags |

DMDW – ETL Projects – Student Presentations

On 13, Mai 2011 | 8 Kommentare | inDMDW lecture | vonJohannes Hoppe

This is a list of all the interesting projects that my students have made.
Well done! I’m happy with the results! 😀

  1. Team „Pure Access“
  2. Team „Access to Access2MySQL to MySQL“
  3. Team „Silverlight to MS SQL“
  4. Team „PHP to MySQL“
  5. Team „Pentaho Data Integration (Kettle)“
  6. Team „SSIS“
  7. Team „Pentaho Data Integration (Kettle)“
  8. Team „Groovy to MongoDB“
  9. Team „Java to MySQL“
  10. Team „SSIS“
  11. Team „Java to MongoDB“
  12. Team „SSIS“


Please note: I’m still waiting on some mails. Please send to me the PowerPoint slides as well as the project files! (see comments below)

Tags |

DMDW Lesson 08 – Further Data Mining Algorithms

On 10, Mai 2011 | Ein Kommentar | inDMDW lecture | vonJohannes Hoppe

This will be our last lesson, so let’s concentrate on finishing the last required algorithms.

Please collect some questions in advance!
I would like to discuss them in the remaining time.
I will prepare a small text exam, too.

Tags |

DMDW Extra Lesson – NoSQL and MongoDB

On 06, Mai 2011 | 2 Kommentare | inDMDW lecture, NoSQL | vonJohannes Hoppe

This Friday I introduced you (== my students) to the wonderful new world of databases that are “not only SQL“ (NoSQL). Later on we played with the MongoDB shell and created, read, updated and deleted some documents.

For the MongoDB workshop we used some training data that you can find here. Please don’t forget to solve the exercises! You can contact me by blog-comment or mail, in case of further questions!


Tags | ,

DMDW – Presentation of your ETL project

On 03, Mai 2011 | 2 Kommentare | inDMDW lecture | vonJohannes Hoppe

Hello Students,

as you know, every student
has to present his ETL project.

The task:

  1. Please Extract the content of the room-plan excel file
    (write me a mail if you don’t know the password)
  2. then Transform it to a new structure.
    (e.g. normalize the content, clean it, deal with typos, parse the dates…)
  3. and finally Load it to a new target.
    (In most cases this target will be a database system.)
  4. Present your solution to the other students, give me your materials. Don’t forget you name on the slides.

Tags |

MongoDB – Training Data

My favourite next topic will be a short excursion to MongoDB. MongoDB (from „humongous“) is a scalable, high-performance, open source, document-oriented database. It has no SQL, JSON-style documents and is fast as hell! 👿

Please install MongoDB as described here: Quickstart Windows
And please download these files with sample data:

These MongoDB Quick Reference Cards can be very helpfull:

Here are the slides for of NoSQL presentation and the MongoDB workshop!

Tags | ,



Keine Kommentare

inRIA lecture

vonJohannes Hoppe

RIA Presentations 2 – Time and Room

On 13, Apr 2011 | Keine Kommentare | inRIA lecture | vonJohannes Hoppe

This Friday, the 15th of April (2011-04-15), I will be happy to see your presentations.
Leets meet at 10 o’clock AM in the foyer; too search for a suitable room!

I like presentations that are well prepared and where all team members are presenting. There is no fixed format ore time box!
It’s up to you how you want to convince me that you’ve created an amazing project. But please keep in mind that too long or too boring presentations won’t be very convincing!  😉

Tags |

DMDW Lesson 05 + 06 + 07 – Data Mining Applied

On 01, Apr 2011 | Ein Kommentar | inDMDW lecture | vonJohannes Hoppe

In this series of lectures (05. + 06. + 07.) we will concentrate on applied data mining.


  • Lesson 05. – 01.04.2011
  • Lesson 06. – 08.04.20111
  • Lesson 07. – 15.04.2011

The current agenda

  1. Applications of Data Mining
  2. Data Mining Algorithms
  3. Repetition – Datatypes, Contentypes
  4. Data Mining Algorithms – Decision Trees
  5. Data Mining Algorithms – Clustering

I will probably change the agenda and slides during the days. Please make sure that you have the latest version.



Tags |

DMDW Materials – Last update: 2011-04-01


I created a SVN repository for the used source codes:
(svn checkout lecture-hoppe-read-only)



I would like to do a practical session with the
SQL Server Business Intelligence Development Studio”.
[BOX_START]Our source will be an old excel file with room plans. You can download it here:

To ensure a little bit privacy, the zip file password protected. Ask me face-to-face for the password!



I’m expecting that you have the following tools
and resources on your laptop:

  • Microsoft Visual Studio 2008 (not 2010, not Express!)

Tags |



Keine Kommentare

inDMDW lecture

vonJohannes Hoppe

DMDW Lesson 04 – Data Mining Theory

On 23, Mrz 2011 | Keine Kommentare | inDMDW lecture | vonJohannes Hoppe

In this lecture we will concentrate on the theory behind data mining.
We will also look at an already seen algorithm: the decision tree!

Please note that this topic continues with the next lesson No 5.


Here are the slides:

Tags |