The video below shows a series of use-cases that highlight the use of KNIME for handling data and model building and Zementis ADAPA for model deployment and scoring via PMML, the Predictive Model Markup Language. PMML is the de facto standard to represent data mining models. It allows for predictive models to be easily moved from one application to another since with PMML, there is no need for model re-coding.
The idea here is to show how easy one can easily move a predictive model from the scientist's desktop to the IT operational environment. When training a model, scientists rely on historical data, but when using the model on a regular basis, the model is moved or deployed in production where it presented with new data. ADAPA provides a scalable and lightning fast scoring engine for models that live in production. And, although KNIME data mining nodes are typically used by scientists to build models, its database and REST nodes as well as PMML-enabled nodes can simply be used to create a flow for passing models and data for scoring in ADAPA.
- Read data from a flat file, use KNIME for data pre-processing and building of a neural network model. Export the entire predictive workflow as a PMML file and then take this PMML file and upload and score it in ADAPA via its Admin Web Console.
- Read data from a database (MySQL, SQLServer, Oracle, ...), build model in KNIME, export model as a PMML file and deploy it in ADAPA using its REST API. This use-case also shows new or testing data flowing from the database and into ADAPA for scoring via a sequence of KNIME nodes.
- The video also shows a case in which one can use KNIME nodes to simply read a PMML file produced in any PMML-compliant data mining tool (R, SAS EM, SPSS, ...), upload it in ADAPA using the REST API and score new data from MySQL in ADAPA also through the REST interface. Note that in this case, the model has already been trained and we are just using KNIME to deploy the existing PMML file in ADAPA for scoring.