<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>03131cam a2200289 a 4500</leader>
  <controlfield tag="001">1/36738</controlfield>
  <controlfield tag="008">090212s2009    ne            001 0 eng  </controlfield>
  <datafield tag="020" ind1=" " ind2=" ">
   <subfield code="a">9780123735775</subfield>
  </datafield>
  <datafield tag="020" ind1=" " ind2=" ">
   <subfield code="a">0123735777</subfield>
  </datafield>
  <datafield tag="035" ind1=" " ind2=" ">
   <subfield code="l">39331</subfield>
  </datafield>
  <datafield tag="040" ind1=" " ind2=" ">
   <subfield code="a">OPELS</subfield>
   <subfield code="b">eng</subfield>
   <subfield code="c">OPELS</subfield>
   <subfield code="d">OCLCQ</subfield>
   <subfield code="d">ZMC</subfield>
   <subfield code="d">OCLCQ</subfield>
   <subfield code="d">OCLCF</subfield>
   <subfield code="d">GR-PeUP</subfield>
  </datafield>
  <datafield tag="100" ind1="1" ind2=" ">
   <subfield code="a">Refaat, Mamdouh.</subfield>
  </datafield>
  <datafield tag="245" ind1="1" ind2="0">
   <subfield code="a">Data preparation for data mining using SAS</subfield>
   <subfield code="h">[electronic resource] /</subfield>
   <subfield code="c">Mamdouh Refaat.</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
   <subfield code="a">Amsterdam ;</subfield>
   <subfield code="a">Boston :</subfield>
   <subfield code="b">Morgan Kaufmann Publishers,</subfield>
   <subfield code="c">c2007.</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
   <subfield code="a">1 online resource (xxi, 399 p.) :</subfield>
   <subfield code="b">ill.</subfield>
  </datafield>
  <datafield tag="490" ind1="1" ind2=" ">
   <subfield code="a">The Morgan Kaufmann series in data management systems</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
   <subfield code="a">Are you a data mining analyst, who spends up to 80% of your time assuring data quality, then preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes to practical advice on developing good mining views find little how to information? And are you, like most analysts, preparing the data in SAS? This book is intended to fill this gap as your source of practical recipes. It introduces a framework for the process of data preparation for data mining, and presents the detailed implementation of each step in SAS. In addition, business applications of data mining modeling require you to deal with a large number of variables, typically hundreds if not thousands. Therefore, the book devotes several chapters to the methods of data transformation and variable selection. FEATURES * A complete framework for the data preparation process, including implementation details for each step. * The complete SAS implementation code, which is readily usable by professional analysts and data miners. * A unique and comprehensive approach for the treatment of missing values, optimal binning, and cardinality reduction. * Assumes minimal proficiency in SAS and includes a quick-start chapter on writing SAS macros. * CD includes dozens of SAS macros plus the sample data and the program for the book's case study.</subfield>
  </datafield>
  <datafield tag="505" ind1="0" ind2="0">
   <subfield code="t">Introduction --</subfield>
   <subfield code="t">Tasks and Data Flow --</subfield>
   <subfield code="t">Review of Data Mining Modeling Techniques --</subfield>
   <subfield code="t">SAS Macros: A Quick Start --</subfield>
   <subfield code="t">Data Acquisition and Integration --</subfield>
   <subfield code="t">Integrity Checks --</subfield>
   <subfield code="t">Sampling and Partitioning --</subfield>
   <subfield code="t">Data Transformations --</subfield>
   <subfield code="t">Binning and Reduction of Cardinality --</subfield>
   <subfield code="t">Treatment of Missing Values --</subfield>
   <subfield code="t">Predictive Power and Variable Reduction I --</subfield>
   <subfield code="t">Analysis of Nominal and Ordinal Variables --</subfield>
   <subfield code="t">Analysis of Continuous Variables --</subfield>
   <subfield code="t">Principal Component Analysis (PCA) 2 --</subfield>
   <subfield code="t">Factor Analysis --</subfield>
   <subfield code="t">Predictive Power and Variable Reduction II --</subfield>
   <subfield code="t">Putting it All Together --</subfield>
   <subfield code="t">A Listing of SAS Macros.</subfield>
  </datafield>
  <datafield tag="504" ind1=" " ind2=" ">
   <subfield code="a">Includes bibliographical references (p. 373-374) and index.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="4">
   <subfield code="a">Data mining.</subfield>
  </datafield>
  <datafield tag="630" ind1="0" ind2="0">
   <subfield code="a">SAS (Computer file)</subfield>
  </datafield>
  <datafield tag="655" ind1=" " ind2="4">
   <subfield code="a">Electronic books.</subfield>
  </datafield>
  <datafield tag="830" ind1=" " ind2="0">
   <subfield code="a">Morgan Kaufmann series in data management systems.</subfield>
  </datafield>
  <datafield tag="852" ind1=" " ind2=" ">
   <subfield code="a">INST</subfield>
   <subfield code="b">UNIPILB</subfield>
   <subfield code="c">EBOOKS</subfield>
   <subfield code="e">20100617</subfield>
   <subfield code="p">00b39331</subfield>
   <subfield code="q">00b39331</subfield>
   <subfield code="t">ONLINE</subfield>
   <subfield code="y">0</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="3">ScienceDirect</subfield>
   <subfield code="u">http://www.sciencedirect.com/science/book/9780123735775</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
   <subfield code="d">/webopac/covers/02/39331_9780123735775.jpg</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
   <subfield code="d">/webopac/covers/02/39331_0123735777.jpg</subfield>
  </datafield>
 </record>
</collection>
