SAN JOSE, Calif., March 14, 2017 -- (Strata + Hadoop World, Booth #1321) - Pentaho, a Hitachi Group Company, today announced orchestration capabilities that streamline the entire machine learning workflow and enable teams of data scientists, engineers and analysts to train, tune, test and deploy predictive models. Pentaho’s Data Integration and analytics platform ends the ‘gridlock’ associated with machine learning by enabling smooth team collaboration, maximizing limited data science resources and putting predictive models to work on big data faster -- regardless of use case, industry, or language -- whether models were built in R, Python, Scala or Weka.
Streamlining four areas of the machine learning workflow
With Pentaho’s machine learning orchestration, the process of building and deploying advanced analytics models maximizes efficiency. Most enterprises struggle to put predictive models to work because data professionals often operate in silos and the workflow -- from data preparation to updating models -- create bottlenecks. Pentaho’s platform enables collaboration and removes bottlenecks in four key areas:
- Data and feature engineering -- Pentaho helps data scientists and engineers easily prepare and blend traditional sources like ERP, EAM and big data sources like sensors and social media. Pentaho also accelerates the notoriously difficult and costly task of feature engineering by automating data onboarding, data transformation and data validation in an easy-to-use drag and drop environment.
- Model training, tuning and testing -- Data scientists often apply trial and error to strike the right balance of complexity, performance and accuracy in their models. With integrations for languages like R and Python, and for machine learning packages like Spark MLlib and Weka, Pentaho allows data scientists to seamlessly train, tune, build and test models faster.
- Model deployment and operationalization -- a completely trained, tuned and tested machine learning model still needs to be deployed. Pentaho allows data professionals to easily embed models developed by the data scientist directly in a data workflow. They can leverage existing data and feature engineering efforts, significantly reducing time-to-deployment. With embeddable APIs, organizations can also include the full power of Pentaho within existing applications.
- Update models regularly -- According to Ventana Research, less than a third (31%) of organizations use an automated process to update their models. With Pentaho, data engineers and scientists can re-train existing models with new data sets or make feature updates using custom execution steps for R, Python, Spark MLlib and Weka. Pre-built workflows can automatically update models and archive existing ones.
Hitachi Rail uses Pentaho with Hitachi’s Hyper Scale-Out Platform to fulfil its pioneering "Trains-as-a-Service" concept, applying advanced IoT technology in three event horizons: real-time (monitoring, fault alerting), medium-term (predictive maintenance) and long-term (big data trend analysis). With each train carrying thousands of sensors generating huge amounts of data per day, the project’s data engineers and scientists face many challenges associated with big data and machine learning. Although the project is not yet operational, Pentaho is already helping to deliver productivity improvements across the business.
According to Philip Hewlett, Project Manager, “Hitachi Rail conservatively estimate that Pentaho’s orchestration capabilities for data preparation, engineering and machine learning have already delivered wide-reaching productivity improvements and specialized development ambitions which will translate into value-added services for our customers -- and this is at a very early stage in the project.”
David Menninger, SVP & Research Director, Ventana Research, commented, “According to our research, 92 percent of organizations plan to deploy more predictive analytics, however, 50 percent of organizations have difficulty integrating predictive analytics into their information architecture. Pentaho offers a robust platform to help companies take advantage of machine learning algorithms throughout their organization, helping business units and IT to work together with the common goal of making predictive analytics deliver value to the enterprise.”
Wael Elrifai, Director of Worldwide Enterprise Data Science, Pentaho said, “In 2017 we’re working with early adopters looking to transform their businesses with machine learning. Fortunately our early foray into big data analytics gave us the insight to solve some of the toughest challenges in this area. As part of Hitachi, which has a large team of data science experts, we will continue growing our machine learning capabilities as this market matures.”
The machine learning orchestration capabilities are available in Pentaho 7.0.
Resources
- Learn more about Pentaho’s machine learning orchestration capabilities
- Visit us at booth #1321 at Strata+Hadoop World San Jose the week of March 13th
- Join Pentaho for our Strata session “Five steps to a killer data lake, from ingest to machine learning” on March 15 at 1:50 pm.
ABOUT PENTAHO, A HITACHI GROUP COMPANY
Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho’s mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, Caterpillar Marine Asset Intelligence, EMC, Moody's, NASDAQ and Sears Holdings Corporation. For more information visit www.pentaho.com.
Pentaho Media Contact US & Worldwide Rebecca Shomair Director of Corporate Communications [email protected] +1 646-484-8150


Microsoft Commits $18 Billion to Expand AI and Cloud Infrastructure in Australia
Tesla Q1 Earnings Preview: Robotaxi Delays and SpaceX Merger Speculation Grow
European Car Sales Surge in March as EV and Hybrid Demand Accelerates
Indonesia and Toyota Explore $300M Bioethanol Investment to Boost Renewable Energy Goals
LG Innotek Stock Hits Record High on $68M Automotive Wi-Fi 7 Deal
China Food Delivery Stocks Dip as Regulators Crack Down on “Ghost Deliveries”
OPmobility Reports Q1 Revenue Dip Amid Automotive Industry Slowdown
Elon Musk Faces French Probe Over X and Grok Amid Rising U.S.-EU Tensions
Kakaku.com Stock Surges on EQT Takeover Interest Amid Rising Japan Deal Activity
SK Hynix Reports Record Q1 Profit Surge Driven by AI Memory Chip Demand
Amazon Expands AI Bet with Up to $25 Billion Investment in Anthropic
J.P. Morgan Downgrades Essity AB on Rising Costs and Weak Earnings Outlook
Florida Investigates OpenAI and ChatGPT Over Alleged Role in FSU Shooting
Jeff Bezos Eyes $10 Billion Funding Round for AI Venture Project Prometheus
John Ternus Signals Apple’s Future with Product-First AI Strategy
Chinese Robotics Stocks React as Humanoid Robot Marathon Sparks Competition Concerns
Rising Jet Fuel Costs from Iran Conflict Push Airfare Higher Across Europe 



