Hadoop Security

Author: Ben Spivey
Publisher: "O'Reilly Media, Inc."
ISBN: 9781491901342
Release Date: 2015-06-29
Genre: Computers

As more corporations turn to Hadoop to store and process their most valuable data, the risk of a potential breach of those systems increases exponentially. This practical book not only shows Hadoop administrators and security architects how to protect Hadoop data from unauthorized access, it also shows how to limit the ability of an attacker to corrupt or modify data in the event of a security breach. Authors Ben Spivey and Joey Echeverria provide in-depth information about the security features available in Hadoop, and organize them according to common computer security concepts. You’ll also get real-world examples that demonstrate how you can apply these concepts to your use cases. Understand the challenges of securing distributed systems, particularly Hadoop Use best practices for preparing Hadoop cluster hardware as securely as possible Get an overview of the Kerberos network authentication protocol Delve into authorization and accounting principles as they apply to Hadoop Learn how to use mechanisms to protect data in a Hadoop cluster, both in transit and at rest Integrate Hadoop data ingest into enterprise-wide security architecture Ensure that security architecture reaches all the way to end-user access

Big Data Analytics with Hadoop 3

Author: Sridhar Alla
Publisher: Packt Publishing Ltd
ISBN: 9781788624954
Release Date: 2018-05-31
Genre: Computers

Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Key Features Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloud Integrate Hadoop with other big data tools such as R, Python, Apache Spark, and Apache Flink Exploit big data using Hadoop 3 with real-world examples Book Description Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the software as well as its benefits with the help of practical examples. Once you have taken a tour of Hadoop 3’s latest features, you will get an overview of HDFS, MapReduce, and YARN, and how they enable faster, more efficient big data processing. You will then move on to learning how to integrate Hadoop with the open source tools, such as Python and R, to analyze and visualize data and perform statistical computing on big data. As you get acquainted with all this, you will explore how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing. In addition to this, you will understand how to use Hadoop to build analytics solutions on the cloud and an end-to-end pipeline to perform big data analysis using practical use cases. By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform big data analytics and get insight effortlessly. What you will learn Explore the new features of Hadoop 3 along with HDFS, YARN, and MapReduce Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples Integrate Hadoop with R and Python for more efficient big data processing Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics Set up a Hadoop cluster on AWS cloud Perform big data analytics on AWS using Elastic Map Reduce Who this book is for Big Data Analytics with Hadoop 3 is for you if you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3’s powerful features, or you’re new to big data analytics. A basic understanding of the Java programming language is required.

Big Data

Author: Viktor Mayer-Schönberger
Publisher: Redline Wirtschaft
ISBN: 9783864144592
Release Date: 2013-10-08
Genre: Political Science

Ob Kaufverhalten, Grippewellen oder welche Farbe am ehesten verrät, ob ein Gebrauchtwagen in einem guten Zustand ist – noch nie gab es eine solche Menge an Daten und noch nie bot sich die Chance, durch Recherche und Kombination in der Daten¬flut blitzschnell Zusammenhänge zu entschlüsseln. Big Data bedeutet nichts weniger als eine Revolution für Gesellschaft, Wirtschaft und Politik. Es wird die Weise, wie wir über Gesundheit, Erziehung, Innovation und vieles mehr denken, völlig umkrempeln. Und Vorhersagen möglich machen, die bisher undenkbar waren. Die Experten Viktor Mayer-Schönberger und Kenneth Cukier beschreiben in ihrem Buch, was Big Data ist, welche Möglichkeiten sich eröffnen, vor welchen Umwälzungen wir alle stehen – und verschweigen auch die dunkle Seite wie das Ausspähen von persönlichen Daten und den drohenden Verlust der Privatsphäre nicht.

Practical Guide to SAP HANA and Big Data Analytics

Author: Dominique Alfermann
Publisher: Espresso Tutorials GmbH
ISBN: 9783960128649
Release Date: 2018-12-20
Genre:

In this book written for SAP BI, big data, and IT architects, the authors expertly provide clear recommendations for building modern analytics architectures running on SAP HANA technologies. Explore integration with big data frameworks and predictive analytics components. Obtain the tools you need to assess possible architecture scenarios and get guidelines for choosing the best option for your organization. Know your options for on-premise, in the cloud, and hybrid solutions. Readers will be guided through SAP BW/4HANA and SAP HANA native data warehouse scenarios, as well as field-tested integration options with big data platforms. Explore migration options and architecture best practices. Consider organizational and procedural changes resulting from the move to a new, up-to-date analytics architecture that supports your data-driven or data-informed organization. By using practical examples, tips, and screenshots, this book explores: - SAP HANA and SAP BW/4HANA architecture concepts - Predictive Analytics and Big Data component integration - Recommendations for a sustainable, future-proof analytics solutions - Organizational impact and change management

Big Data Analytics

Author: Kim H. Pries
Publisher: CRC Press
ISBN: 9781482234527
Release Date: 2015-02-05
Genre: Computers

With this book, managers and decision makers are given the tools to make more informed decisions about big data purchasing initiatives. Big Data Analytics: A Practical Guide for Managers not only supplies descriptions of common tools, but also surveys the various products and vendors that supply the big data market. Comparing and contrasting the different types of analysis commonly conducted with big data, this accessible reference presents clear-cut explanations of the general workings of big data tools. Instead of spending time on HOW to install specific packages, it focuses on the reasons WHY readers would install a given package. The book provides authoritative guidance on a range of tools, including open source and proprietary systems. It details the strengths and weaknesses of incorporating big data analysis into decision-making and explains how to leverage the strengths while mitigating the weaknesses. Describes the benefits of distributed computing in simple terms Includes substantial vendor/tool material, especially for open source decisions Covers prominent software packages, including Hadoop and Oracle Endeca Examines GIS and machine learning applications Considers privacy and surveillance issues The book further explores basic statistical concepts that, when misapplied, can be the source of errors. Time and again, big data is treated as an oracle that discovers results nobody would have imagined. While big data can serve this valuable function, all too often these results are incorrect, yet are still reported unquestioningly. The probability of having erroneous results increases as a larger number of variables are compared unless preventative measures are taken. The approach taken by the authors is to explain these concepts so managers can ask better questions of their analysts and vendors as to the appropriateness of the methods used to arrive at a conclusion. Because the world of science and medicine has been grappling with similar issues in the publication of studies, the authors draw on their efforts and apply them to big data.

Microsoft Big Data Solutions

Author: Adam Jorgensen
Publisher: John Wiley & Sons
ISBN: 9781118729557
Release Date: 2014-02-24
Genre: Computers

Tap the power of Big Data with Microsoft technologies Big Data is here, and Microsoft's new Big Data platform is a valuable tool to help your company get the very most out of it. This timely book shows you how to use HDInsight along with HortonWorks Data Platform for Windows to store, manage, analyze, and share Big Data throughout the enterprise. Focusing primarily on Microsoft and HortonWorks technologies but also covering open source tools, Microsoft Big Data Solutions explains best practices, covers on-premises and cloud-based solutions, and features valuable case studies. Best of all, it helps you integrate these new solutions with technologies you already know, such as SQL Server and Hadoop. Walks you through how to integrate Big Data solutions in your company using Microsoft's HDInsight Server, HortonWorks Data Platform for Windows, and open source tools Explores both on-premises and cloud-based solutions Shows how to store, manage, analyze, and share Big Data through the enterprise Covers topics such as Microsoft's approach to Big Data, installing and configuring HortonWorks Data Platform for Windows, integrating Big Data with SQL Server, visualizing data with Microsoft and HortonWorks BI tools, and more Helps you build and execute a Big Data plan Includes contributions from the Microsoft and HortonWorks Big Data product teams If you need a detailed roadmap for designing and implementing a fully deployed Big Data solution, you'll want Microsoft Big Data Solutions.

Datenanalyse mit Python

Author: Wes McKinney
Publisher: O'Reilly
ISBN: 9783960102144
Release Date: 2018-10-29
Genre: Computers

Erfahren Sie alles über das Manipulieren, Bereinigen, Verarbeiten und Aufbereiten von Datensätzen mit Python: Aktualisiert auf Python 3.6, zeigt Ihnen dieses konsequent praxisbezogene Buch anhand konkreter Fallbeispiele, wie Sie eine Vielzahl von typischen Datenanalyse-Problemen effektiv lösen. Gleichzeitig lernen Sie die neuesten Versionen von pandas, NumPy, IPython und Jupyter kennen.Geschrieben von Wes McKinney, dem Begründer des pandas-Projekts, bietet Datenanalyse mit Python einen praktischen Einstieg in die Data-Science-Tools von Python. Das Buch eignet sich sowohl für Datenanalysten, für die Python Neuland ist, als auch für Python-Programmierer, die sich in Data Science und Scientific Computing einarbeiten wollen. Daten und zugehöriges Material des Buchs sind auf GitHub verfügbar.Aus dem Inhalt:Nutzen Sie die IPython-Shell und Jupyter Notebook für das explorative ComputingLernen Sie Grundfunktionen und fortgeschrittene Features von NumPy kennenSetzen Sie die Datenanalyse-Tools der pandasBibliothek einVerwenden Sie flexible Werkzeuge zum Laden, Bereinigen, Transformieren, Zusammenführen und Umformen von DatenErstellen Sie interformative Visualisierungen mit matplotlibWenden Sie die GroupBy-Mechanismen von pandas an, um Datensätzen zurechtzuschneiden, umzugestalten und zusammenzufassenAnalysieren und manipulieren Sie verschiedenste Zeitreihen-DatenFür diese aktualisierte 2. Auflage wurde der gesamte Code an Python 3.6 und die neuesten Versionen der pandas-Bibliothek angepasst. Neu in dieser Auflage: Informationen zu fortgeschrittenen pandas-Tools sowie eine kurze Einführung in statsmodels und scikit-learn.

In Memory Datenbank SAP HANA

Author: Peter Preuss
Publisher: Springer-Verlag
ISBN: 9783658186036
Release Date: 2017-06-16
Genre: Business & Economics

Der vorliegende Sammelband enthält Beiträge zum Thema SAP HANA, einer Softwareplattform, die im Kern aus einer In-Memory-Datenbank besteht. Im ersten Beitrag werden die theoretischen Grundlagen gelegt und der aktuelle Markt für In-Memory-Datenbanken untersucht. Anhand eines konkreten Fallbeispiels wird im darauffolgenden Beitrag die Vorgehensweise bei einer SAP-HANA-Migration erläutert. Anschließend werden mögliche Migrationsstrategien für einen Systemwechsel von SAP ERP nach SAP HANA diskutiert. Der vierte Beitrag geht auf den SAP Solution Manager ein, mit dem die Einführung von SAP HANA unterstützt wird. Schließlich werden Backup- und Recovery-Konzepte untersucht, die eine wichtige Rolle bei In-Memory-Datenbanken spielen.​

big data work

Author: Thomas H. Davenport
Publisher: Vahlen
ISBN: 9783800648153
Release Date: 2014-10-15
Genre: Fiction

Big Data in Unternehmen. Dieses neue Buch gibt Managern ein umfassendes Verständnis dafür, welche Bedeutung Big Data für Unternehmen zukünftig haben wird und wie Big Data tatsächlich genutzt werden kann. Am Ende jedes Kapitels aktivieren Fragen, selbst nach Lösungen für eine erfolgreiche Implementierung und Nutzung von Big Data im eigenen Unternehmen zu suchen. Die Schwerpunkte - Warum Big Data für Sie und Ihr Unternehmen wichtig ist - Wie Big Data Ihre Arbeit, Ihr Unternehmen und Ihre Branche verändern - - wird - Entwicklung einer Big Data-Strategie - Der menschliche Aspekt von Big Data - Technologien für Big Data - Wie Sie erfolgreich mit Big Data arbeiten - Was Sie von Start-ups und Online-Unternehmen lernen können - Was Sie von großen Unternehmen lernen können: Big Data und Analytics 3.0 Der Experte Thomas H. Davenport ist Professor für Informationstechnologie und -management am Babson College und Forschungswissenschaftler am MIT Center for Digital Business. Zudem ist er Mitbegründer und Forschungsdirektor am International Institute for Analytics und Senior Berater von Deloitte Analytics.

Harness the Power of Big Data The IBM Big Data Platform

Author: Paul Zikopoulos
Publisher: McGraw Hill Professional
ISBN: 9780071808170
Release Date: 2012-10-18
Genre: Computers

Boost your Big Data IQ! Gain insight into how to govern and consume IBM’s unique in-motion and at-rest Big Data analytic capabilities Big Data represents a new era of computing—an inflection point of opportunity where data in any format may be explored and utilized for breakthrough insights—whether that data is in-place, in-motion, or at-rest. IBM is uniquely positioned to help clients navigate this transformation. This book reveals how IBM is infusing open source Big Data technologies with IBM innovation that manifest in a platform capable of "changing the game." The four defining characteristics of Big Data—volume, variety, velocity, and veracity—are discussed. You’ll understand how IBM is fully committed to Hadoop and integrating it into the enterprise. Hear about how organizations are taking inventories of their existing Big Data assets, with search capabilities that help organizations discover what they could already know, and extend their reach into new data territories for unprecedented model accuracy and discovery. In this book you will also learn not just about the technologies that make up the IBM Big Data platform, but when to leverage its purpose-built engines for analytics on data in-motion and data at-rest. And you’ll gain an understanding of how and when to govern Big Data, and how IBM’s industry-leading InfoSphere integration and governance portfolio helps you understand, govern, and effectively utilize Big Data. Industry use cases are also included in this practical guide.

Active directory

Author: Robbie Allen
Publisher: O'Reilly Verlag DE
ISBN: 3897211734
Release Date: 2004
Genre:


Professional Hadoop Solutions

Author: Boris Lublinsky
Publisher: John Wiley & Sons
ISBN: 9781118824184
Release Date: 2013-09-12
Genre: Computers

The go-to guidebook for deploying Big Data solutions withHadoop Today's enterprise architects need to understand how the Hadoopframeworks and APIs fit together, and how they can be integrated todeliver real-world solutions. This book is a practical, detailedguide to building and implementing those solutions, with code-levelinstruction in the popular Wrox tradition. It covers storing datawith HDFS and Hbase, processing data with MapReduce, and automatingdata processing with Oozie. Hadoop security, running Hadoop withAmazon Web Services, best practices, and automating Hadoopprocesses in real time are also covered in depth. With in-depth code examples in Java and XML and the latest onrecent additions to the Hadoop ecosystem, this complete resourcealso covers the use of APIs, exposing their inner workings andallowing architects and developers to better leverage and customizethem. The ultimate guide for developers, designers, and architectswho need to build and deploy Hadoop applications Covers storing and processing data with various technologies,automating data processing, Hadoop security, and deliveringreal-time solutions Includes detailed, real-world examples and code-levelguidelines Explains when, why, and how to use these tools effectively Written by a team of Hadoop experts in theprogrammer-to-programmer Wrox style Professional Hadoop Solutions is the reference enterprisearchitects and developers need to maximize the power of Hadoop.

Oracle Big Data Handbook

Author: Tom Plunkett
Publisher: McGraw Hill Professional
ISBN: 9780071827263
Release Date: 2013-09-25
Genre: Computers

"Cowritten by members of Oracle's big data team, [this book] provides complete coverage of Oracle's comprehensive, integrated set of products for acquiring, organizing, analyzing, and leveraging unstructured data. The book discusses the strategies and technologies essential for a successful big data implementation, including Apache Hadoop, Oracle Big Data Appliance, Oracle Big Data Connectors, Oracle NoSQL Database, Oracle Endeca, Oracle Advanced Analytics, and Oracle's open source R offerings"--Page 4 of cover.

Smart Data Analytics

Author: Andreas Wierse
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 9783110461916
Release Date: 2017-06-26
Genre: Technology & Engineering

Wenn in Datenbergen wertvolle Geheimnisse schlummern, aus denen Profit erzielt werden soll, dann geht es um Big Data. Doch wie schöpft man aus »großen Daten« echte Werte, wenn man nicht gerade Google ist? Um aus Unternehmens-, Maschinen- oder Sensordaten einen Ertrag zu erzielen, reicht Big Data-Technologie allein nicht aus. Entscheidend sind die übergeordneten Innovations prozesse: die smarte Analyse von Big Data. Erst durch den kompetenten Einsatz der richtigen Werkzeuge und Techniken werden aus Big Data tatsächlich Smart Data. Das Praxishandbuch Smart Data Analytics gibt einen Überblick über die Technologie, die bei der Analyse von großen und heterogenen Datenmengen – inklusive Echtzeitdaten – zum Einsatz kommt. Elf Praxisbeispiele zeigen die konkrete Anwendung in kleinen und mittelständischen Unternehmen. So erfahren Sie, wie Sie Ihr Smart Data Analytics-Projekt in Ihrem eigenen Unternehmen vorbereiten und umsetzen können. Das Buch erläutert neben den organisatorischen Aspekten auch die rechtlichen Rahmenbedingungen. Und es zeigt, wie Sie sowohl den Nutzen bewerten können, der aus den Daten gezogen werden soll, als auch den Aufwand, den Sie dafür betreiben müssen. Denn Smart Data steht für mehr als nur die Untersuchung großer Datenmengen: Smart Data Analytics ist der Schlüssel zu einem smarten Umgang mit Ihren Unternehmensdaten und hilft, bislang unentdecktes Potenzial zu entdecken. Dr. Andreas Wierse studierte Mathematik und promovierte in den Ingenieurwissenschaften im Bereich Visualisierung, seit 2011 unterstützt er mittelständische Unternehmen rund um Big und Smart Data Technologie. Dr. Till Riedel lehrt als Informatiker am KIT und koordiniert im Smart Data Solution Center Baden-Württemberg und Smart Data Innovation Lab Forschung und Innovation auf industriellen Datenschätzen.