Data Mining Group PMML FAQ

Frequently Asked Questions about PMML

Q. What is Predictive Model Markup Language (PMML)?

Predictive Model Markup Language (PMML) is an XML mark up language to describe statistical and data mining models.

Q. How would you use Predictive Model Markup Language (PMML) to describe a predictive model?

PMML describes the inputs to data mining models, the transformations used prior to prepare data for data mining, and the parameters which define the models themselves.

Q. How is PMML used?

A. PMML is used for a wide variety of applications, including applications in finance, e-business, direct marketing, manufacturing, and defense.

Q. Who has released PMML products?

A. PMML is used in released products by many vendors, including both those active in the Data Mining Group who are developing the standard, as well as others who just desire an XML interchange format for statistical and data mining models. It is the most widely deployed data mining standard.

Q. How is PMML related to other data mining standards?

PMML is complementary to many other data mining standards. It's XML interchange formats is supported by several other standards, such as XML for Analysis, JSR 73, and SQL/MM Part 6: Data Mining.

Q. What is the official location of schema that can be used to verify a model?

Most recently released documentation package available on Sourceforge is the official location of schema.

Q. How is Version 3.0 different from earlier versions?

Version 3.0 has improvements and additions relating to Association Rules, Builtin Functions, Clustering Model and Data Dictionary.

Q. How is Version 2.1 different than earlier versions?

Version 2.1 supports a richer set of transformations. It also uses XML schemas instead of DTDs. In addition, a number of minor changes were made for consistency.

Q. How can I help?

One way to help is to contribute to a library of PMML examples that we are assembling. Please send these via email to the Public Forum discussion list on the PMML Project Page at Source Forge. We will assemble them and put them on the DMG home page.

Q. Why does it take so long for the DMG to respond to email?

We are a volunteer organization without any full time staff. We are actively looking for a volunteer who can help with some administrative support. Please send email to info at dmg.org if you are interested.

Q. Is the Data Mining Group part of any standards group?

A. PMML is part of xml.org.

Q. What is the release history for PMML?

PMML Version 0.7 developed by National Center for Data Mining, July 1997.
PMML Version 0.9 developed, Data Mining Group formed, July, 1998
PMML Version 1.0 released, August, 1999.
PMML Version 1.1 released, August, 2000.
PMML Version 2.0 released, August, 2001.
PMML Version 2.1 released, March, 2003.
PMML Version 3.0 released, October, 2004.
PMML Version 3.1 released, December, 2005.
PMML Version 3.2, released, May 2007.

e-mail info at dmg.org