- Pentaho Community Contributions (#KCM19)Leonardo Coelho, Software Engineer at Hitachi Vantara, and myself presented at the Kettle Community Meeting 2019 (#KCM19): How can a Community Member contribute to Pentaho? We both think it is helpful for the community to create a blog post and list noteworthy links that we provided. It might also help to get found by *any* … Lesen fortsetzen
- Pentaho Community Meeting PCM18 is coming to BolognaIn short: The 11th Pentaho Community Event will be in beautiful Bologna on November 23 through 25, 2018. More details over here: PCM18 First of all, it is awesome to see how the Italia Pentaho User Group together with IT-Novum is organizing this event. And these guys are doing a fantastic job, thanks to all … Lesen fortsetzen
- Pentaho Community Meeting PCM17 – Call for CollectiblesJust in case you don’t know, yet: Pentaho Community Meeting PCM17 celebrates its 10th anniversary this November (10th through 13th) with a 3-days event full of presentations and networking. And it-novum has the honor to host the event together with Pentaho. Read more over here: PCM17 and FAQ Well, and I’m sure you already know this … Lesen fortsetzen
- Metadata Injection – Examples for Special ScenariosSince Metadata Injection (MDI) with Pentaho Data Integration gets more and more popular and used in a lot of projects, this blog post provides examples that help in special scenarios. It is a follow up from my previous blog post about Metadata Injection that provide you with some more basics and background. The feature has … Lesen fortsetzen
- How to visualize PDI Data Lineage with yEdA description of the PDI Data Lineage feature including set up instructions can be found within the Pentaho 6.1 Documentation and in Pedro Alves‘ blog post Seeing PDI lineage information. This post is a step-by-step example showing how to use PDI Data Lineage with yEd. Setup PDI Data Lineage Modify …\system\karaf\etc\pentaho.metaverse.cfg (Client & DI-Server when … Lesen fortsetzen
- Pentaho 6.1 is Released – Part 2 – Metadata InjectionAs I mentioned yesterday, Metadata Injection is my favorite part of this release. Now I get to explain why. Metadata Injection (MDI) gives you the ability to modify transformations at execution time. By dynamically passing source metadata to PDI at run time, IT teams can drive hundreds of data ingestion and preparation processes through just … Lesen fortsetzen
- Pentaho 6.1 is Released – Part 1 – New Features and ImprovementsI am very excited to share that today we announced Pentaho 6.1 (see also the German press release). There is so much to share that I am dividing this into two blog posts. Today I will share a summary of the new features and improvements of Pentaho 6.1. Stay tuned for tomorrow when I will … Lesen fortsetzen
- Some Historic Cornerstones of Kettle & PentahoEvery time when there is something new coming on the horizon, it’s a good time to look back. The intention of Hitachi Data Systems to acquire Pentaho is such a remarkable move that it opens up a great new chapter in Pentaho’s history. As I just researched the details of the journey, I was very … Lesen fortsetzen
- PDI/Kettle Solution Share (Presented at #PCMD15)This has been presented at the German Pentaho Community Meeting 2015 (#PCMD15): Before you share a PDI solution with the community, clients etc. you probably want to ensure that it does not contain any security related information you do not want to share. There are many information within transformations and jobs that you may not … Lesen fortsetzen
- PDI/Kettle Telemetry (Presented at #PCM14)This has been presented at the Pentaho Community Meeting 2014 (#PCM14): There are many reasons to collect usage statistics, for example: It can help in improving the product in the main used areas and features (steps, job entries, database types etc.) It can help the user to determine if some features are effected by a … Lesen fortsetzen
- Pentaho 5.2 Released at PentahoWorld 2014A lot of exiting things happened at PentahoWorld 2014, you can read a lot about it on Twitter #PWorld2014. It is amazing what momentum we reached and that over 400 attendees have been at this conference. BTW: Did you know that Pentaho celebrated it’s 10th anniversary last week, too? For me, as a „PDI/Kettle addicted … Lesen fortsetzen
- PentahoWorld 2014 – The 1st Annual Worldwide Conference is coming soonI’m very happy to see PentahoWorld coming soon on October 8 through 10, 2014. It is our first(!) worldwide conference for Users, Advocates and Partners of Pentaho. After many years of Pentaho community meetings, this is a next logical step in boosting the great Pentaho story. I initiated and organized the first Penaho community meeting. … Lesen fortsetzen
- Pentaho 5.1 EE & CE is released – What’s New in PDI & Big Data?Pentaho 5.1 delivers code-free analytics directly on MongoDB, simplifies data preparation for data scientists and adds full YARN support. See the full list of Pentaho Business Analytics 5.1 features and technical specifications. Here are some more details about the new PDI 5.1 release: Native (YARN) Hadoop integration PDI includes support for YARN capabilities including enhanced … Lesen fortsetzen
- Pentaho 5.0 CE is released – What’s New in PDI & Big Data?Additional to the release of the Enterprise Edition (EE), Pentaho released the stable build of 5.0 Community Edition (CE). If you can’t wait to get to the download, have a look at our new community.pentaho.com. It hosts all information and the download link. And for the download of a free 30-day trial of Pentaho EE, … Lesen fortsetzen
- What’s New in PDI 4.4?Among the other great additions like Mobile and many other feature enhancements in the Pentaho Business Analytics Suite 4.8, here are the highlights for the new PDI 4.4 release: Pentaho Instaview Pentaho Instaview is the fastest way to start using Pentaho Data Integration to analyze and visualize data. Instaview uses templates to manage the complexities … Lesen fortsetzen
- Call Kettle from PostgreSQL  Part 1Some questions and use cases that brought me to investigate into this area: Can Kettle be called directly from a database trigger? So when rows get inserted, deleted, changed, I want to call a Kettle transformation. Can we call a function in a SQL statement that calculates the fraud probability calculated by a WEKA data … Lesen fortsetzen
- Carte as a Windows ServiceCarte is a simple web server that allows you to execute transformations and jobs remotely and execute transformations clustered. Carte is normally started with a .bat file within Windows environments but there are some use cases to run Carte as a Windows Service, e.g.: When Carte instances are running using a command window, anyone by … Lesen fortsetzen
- Kettle and NoSQL: MongoDBOn May 23rd 2012 Pentaho and 10gen are jointly announcing a partnership to provide direct integration between Pentaho Business Analytics and MongoDB. MongoDB is a scalable, high-performance, open source NoSQL database featuring document-oriented storage, auto-sharding for horizontal scalability, rich document-based queries and fast in-place updates. MongoDB is designed with both scalability and developer agility in … Lesen fortsetzen
- How to deal with Kettle bugs?I’m always amazed about our great community: finding bugs and even fixing them! That’s the way it goes: Thanks to Paul for making my day by this post. When you want to learn more about bug reports, have a look at: Bug Reports and Feature Requests FAQ
- Data Profiling and Data Quality (Human Inference) Integration with KettleData Profiling with DataCleaner (Human Inference) and Kettle It was already possible to profile your data in an easy way with Kettle: Open the Database Explorer, chose a table and right click in the context menu on Data Profile. The result was basic information about the data like Min, Max, Count all for strings and … Lesen fortsetzen
- Pentaho Kettle for Big DataAll of Pentaho’s big data capabilities will be available as open source in the new Pentaho Kettle 4.3 release: Big data capabilities include the ability to input, output, manipulate and report on data using the following Hadoop and NoSQL stores: Cassandra, Hadoop HDFS, Hadoop MapReduce, Hadapt, HBase, Hive, HPCC Systems and MongoDB. With regard to … Lesen fortsetzen
- Things to avoid with PDI (Part 1 to 5)Just watch…
- ExeBatLauncher: Calling .bat Files by .exe Executables (e.g. Having a Spoon.exe)The ExeBatLauncher is a simple way of calling .bat files as .exe files in creating an .exe command that calls the same .bat command. For example you want to call the Spoon.bat file by Spoon.exe, simply rename the ExeBatLauncher.exe (that is found in the distrib folder) to Spoon.exe and copy it to the same directory … Lesen fortsetzen
- First German Pentaho Customer Meeting in MunichToday was our first German Customer Meeting that Bruno initiated and it took place at one of our customer locations @Wirecard AG (many thanks!). A nice (and squeezed ) agenda and around 20 attendees discussed the presented customer solutions, their experiences and what Pentaho can do better, but also: what really works well and this is … Lesen fortsetzen
- The new XML Input Stream (StAX) step in PDI 4.2This step provides the ability to read data from any type of XML file using the StAX parser. The existing Get Data from XML step is easier to use but uses DOM parsers that need in memory processing and even the purging of parts of the file is not sufficient when these parts are very … Lesen fortsetzen
- Pentaho BI 4 Delivers Power to the UserNew interactive reporting and enhanced visualizations enable fast and affordable user-driven BI More details can be found over here: http://www.pentaho.com/power-to-the-user Watch the video !
- Security Considerations and Encryption with KettleKettle is used more and more in enterprises where the standard obfuscation of credentials is not sufficient enough. There are requirements to use strong encryption methods and even to store internal data encrypted (covered in PDI-6168 and PDI-6170). The above use cases inspired me to create some simple transformations to test and play around with … Lesen fortsetzen
- iPhone Tracking: How to read the consolidated.db with KettleI will not discuss about the buzz about the iPhone tracking – all that needs to be discussed is already out. That iPhone is storing locations in the consolidated.db was known a long time ago, but now we got a prove of concept by Pete Warden and Alasdair Allan and this inspired me to dig … Lesen fortsetzen
- How the SQL CASE WHEN construct can help you in pivoting data?Within Kettle you have the Normalizer and Denormalizer steps to help with pivoting (a simplified use case for pivoting is changing the row/column axis, see also transpose). Imagine you would like to have a cross table, you may use a Kettle transformation to accomplish this. (In these days Pentaho Reporting’s cross tab functionality is on … Lesen fortsetzen
- Connecting with SAP BI SystemsThere are a lot of different options to connect with a SAP system from Kettle. It was back in 2009 were I tested the functionality to access a SAP BI system via Webservices (XML/A) and here are my (historic) findings: To analyze the result of the MDX query with Kettle is complicated (you need for … Lesen fortsetzen
- Operational Patterns and the Watchdog Concept for KettleThe Watchdog Concept for Kettle was presented at the Pentaho Community Event in Cascais, Portugal in September 2010. It came into my mind when I created the operational patterns for the Pentaho Data Integration for Database Developers training course and combined this with solutions from electronics to solve these types of problems (e.g. detect software … Lesen fortsetzen
- Another blog about Kettle aka Pentaho Data Integration (PDI)This is the starting point for the blog Fun Stuff about the Open Source ETL Tool Kettle aka Pentaho Data Integration (PDI). The short history: Five years ago, in early December 2005, Matt Casters released the initial open source version of Kettle. I knew Kettle already before this time, developed a plug-in to get data … Lesen fortsetzen