<?xml version="1.0" encoding="UTF-8"?>
<mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" version="3.1" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
  <titleInfo>
    <title>Pro Hadoop data analytics</title>
    <subTitle>designing and building big data systems using the Hadoop ecosystem</subTitle>
  </titleInfo>
  <name type="personal">
    <namePart>Koitzsch, Kerry</namePart>
    <role>
      <roleTerm authority="marcrelator" type="text">creator</roleTerm>
    </role>
  </name>
  <typeOfResource>text</typeOfResource>
  <genre authority="marc">bibliography</genre>
  <originInfo>
    <place>
      <placeTerm type="code" authority="marccountry">cau</placeTerm>
    </place>
    <dateIssued encoding="marc">2017</dateIssued>
    <issuance>monographic</issuance>
  </originInfo>
  <language>
    <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
  </language>
  <physicalDescription>
    <form authority="marcform">print</form>
    <extent>xxi, 298 p.</extent>
  </physicalDescription>
  <abstract>"Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics of classification, clustering, and recommendation" --</abstract>
  <tableOfContents>Overview: Building data analytic systems with Hadoop -- A Scala and Python refresher -- Standard toolkits for Hadoop and analytics -- Relational, NoSQL, and graph databases -- Data pipelines and how to construct them -- Advanced search techniques with Hadoop, Lucene, and Solr -- An overview of analytical techniques and algorithms -- Rule engines, system control, and system orchestration -- Putting it all together: designing a complete analytical system -- Data visualizers: seeing and interacting with the analysis -- A case study in bioinformatics: analyzing microscope slide data -- A bayesian analysis component: identifying credit card fraud -- Searching for oil: geographical data analysis with Apache Mahout -- "Image as big data" systems: some case studies -- Building a general purpose data pipeline -- Conclusions and the future of big data analysis -- A setting up the distributed analytics environment -- Getting, installing, and running the example analytics system.</tableOfContents>
  <note type="statement of responsibility">Kerry Koitzsch.</note>
  <note>Includes bibliographical references and index.</note>
  <subject authority="lcsh">
    <titleInfo>
      <title>Apache Hadoop</title>
    </titleInfo>
  </subject>
  <subject authority="fast">
    <titleInfo>
      <title>Apache Hadoop</title>
    </titleInfo>
  </subject>
  <subject authority="lcsh">
    <topic>Database management</topic>
  </subject>
  <subject authority="fast">
    <topic>Database management</topic>
  </subject>
  <classification authority="lcc">QA76.9.D5 K66 2017</classification>
  <relatedItem type="series">
    <titleInfo>
      <title>Books for professionals by professionals</title>
    </titleInfo>
  </relatedItem>
  <identifier type="isbn">9781484219096</identifier>
  <identifier type="lccn">2016963203</identifier>
  <recordInfo>
    <recordContentSource authority="marcorg">YDXCP</recordContentSource>
    <recordCreationDate encoding="marc">161222</recordCreationDate>
    <recordChangeDate encoding="iso8601">20240709135250.0</recordChangeDate>
    <recordIdentifier source="OSt">19421944</recordIdentifier>
    <languageOfCataloging>
      <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
    </languageOfCataloging>
  </recordInfo>
</mods>
