Author: Rui Miguel Forte
Publisher: Packt Publishing Ltd
Release Date: 2015-06-17
R offers a free and open source environment that is perfect for both learning and deploying predictive modeling solutions in the real world. With its constantly growing community and plethora of packages, R offers the functionality to deal with a truly vast array of problems. This book is designed to be both a guide and a reference for moving beyond the basics of predictive modeling. The book begins with a dedicated chapter on the language of models and the predictive modeling process. Each subsequent chapter tackles a particular type of model, such as neural networks, and focuses on the three important questions of how the model works, how to use R to train it, and how to measure and assess its performance using real world data sets. By the end of this book, you will have explored and tested the most popular modeling techniques in use on real world data sets and mastered a diverse range of techniques in predictive analytics.
Master machine learning techniques with R to deliver insights in complex projects About This Book Understand and apply machine learning methods using an extensive set of R packages such as XGBOOST Understand the benefits and potential pitfalls of using machine learning methods such as Multi-Class Classification and Unsupervised Learning Implement advanced concepts in machine learning with this example-rich guide Who This Book Is For This book is for data science professionals, data analysts, or anyone with a working knowledge of machine learning, with R who now want to take their skills to the next level and become an expert in the field. What You Will Learn Gain deep insights into the application of machine learning tools in the industry Manipulate data in R efficiently to prepare it for analysis Master the skill of recognizing techniques for effective visualization of data Understand why and how to create test and training data sets for analysis Master fundamental learning methods such as linear and logistic regression Comprehend advanced learning methods such as support vector machines Learn how to use R in a cloud service such as Amazon In Detail This book will teach you advanced techniques in machine learning with the latest code in R 3.3.2. You will delve into statistical learning theory and supervised learning; design efficient algorithms; learn about creating Recommendation Engines; use multi-class classification and deep learning; and more. You will explore, in depth, topics such as data mining, classification, clustering, regression, predictive modeling, anomaly detection, boosted trees with XGBOOST, and more. More than just knowing the outcome, you'll understand how these concepts work and what they do. With a slow learning curve on topics such as neural networks, you will explore deep learning, and more. By the end of this book, you will be able to perform machine learning with R in the cloud using AWS in various scenarios with different datasets. Style and approach The book delivers practical and real-world solutions to problems and a variety of tasks such as complex recommendation systems. By the end of this book, you will have gained expertise in performing R machine learning and will be able to build complex machine learning projects using R and its packages.
Author: Eric Mayor
Publisher: Packt Publishing Ltd
Release Date: 2015-09-24
Get to grips with key data visualization and predictive analytic skills using R About This Book Acquire predictive analytic skills using various tools of R Make predictions about future events by discovering valuable information from data using R Comprehensible guidelines that focus on predictive model design with real-world data Who This Book Is For If you are a statistician, chief information officer, data scientist, ML engineer, ML practitioner, quantitative analyst, and student of machine learning, this is the book for you. You should have basic knowledge of the use of R. Readers without previous experience of programming in R will also be able to use the tools in the book. What You Will Learn Customize R by installing and loading new packages Explore the structure of data using clustering algorithms Turn unstructured text into ordered data, and acquire knowledge from the data Classify your observations using Naive Bayes, k-NN, and decision trees Reduce the dimensionality of your data using principal component analysis Discover association rules using Apriori Understand how statistical distributions can help retrieve information from data using correlations, linear regression, and multilevel regression Use PMML to deploy the models generated in R In Detail R is statistical software that is used for data analysis. There are two main types of learning from data: unsupervised learning, where the structure of data is extracted automatically; and supervised learning, where a labeled part of the data is used to learn the relationship or scores in a target attribute. As important information is often hidden in a lot of data, R helps to extract that information with its many standard and cutting-edge statistical functions. This book is packed with easy-to-follow guidelines that explain the workings of the many key data mining tools of R, which are used to discover knowledge from your data. You will learn how to perform key predictive analytics tasks using R, such as train and test predictive models for classification and regression tasks, score new data sets and so on. All chapters will guide you in acquiring the skills in a practical way. Most chapters also include a theoretical introduction that will sharpen your understanding of the subject matter and invite you to go further. The book familiarizes you with the most common data mining tools of R, such as k-means, hierarchical regression, linear regression, association rules, principal component analysis, multilevel modeling, k-NN, Naive Bayes, decision trees, and text mining. It also provides a description of visualization techniques using the basic visualization tools of R as well as lattice for visualizing patterns in data organized in groups. This book is invaluable for anyone fascinated by the data mining opportunities offered by GNU R and its packages. Style and approach This is a practical book, which analyzes compelling data about life, health, and death with the help of tutorials. It offers you a useful way of interpreting the data that's specific to this book, but that can also be applied to any other data.
Exploit the power of data in your business by building advanced predictive modeling applications with Python About This Book Master open source Python tools to build sophisticated predictive models Learn to identify the right machine learning algorithm for your problem with this forward-thinking guide Grasp the major methods of predictive modeling and move beyond the basics to a deeper level of understanding Who This Book Is For This book is designed for business analysts, BI analysts, data scientists, or junior level data analysts who are ready to move from a conceptual understanding of advanced analytics to an expert in designing and building advanced analytics solutions using Python. You're expected to have basic development experience with Python. What You Will Learn Gain an insight into components and design decisions for an analytical application Master the use Python notebooks for exploratory data analysis and rapid prototyping Get to grips with applying regression, classification, clustering, and deep learning algorithms Discover the advanced methods to analyze structured and unstructured data Find out how to deploy a machine learning model in a production environment Visualize the performance of models and the insights they produce Scale your solutions as your data grows using Python Ensure the robustness of your analytic applications by mastering the best practices of predictive analysis In Detail The volume, diversity, and speed of data available has never been greater. Powerful machine learning methods can unlock the value in this information by finding complex relationships and unanticipated trends. Using the Python programming language, analysts can use these sophisticated methods to build scalable analytic applications to deliver insights that are of tremendous value to their organizations. In Mastering Predictive Analytics with Python, you will learn the process of turning raw data into powerful insights. Through case studies and code examples using popular open-source Python libraries, this book illustrates the complete development process for analytic applications and how to quickly apply these methods to your own data to create robust and scalable prediction services. Covering a wide range of algorithms for classification, regression, clustering, as well as cutting-edge techniques such as deep learning, this book illustrates not only how these methods work, but how to implement them in practice. You will learn to choose the right approach for your problem and how to develop engaging visualizations to bring the insights of predictive modeling to life Style and approach This book emphasizes on explaining methods through example data and code, showing you templates that you can quickly adapt to your own use cases. It focuses on both a practical application of sophisticated algorithms and the intuitive understanding necessary to apply the correct method to the problem at hand. Through visual examples, it also demonstrates how to convey insights through insightful charts and reporting.
Author: Conrad Carlberg
Publisher: Que Publishing
Release Date: 2017-07-24
Genre: Business & Economics
EXCEL 2016 PREDICTIVE ANALYTICS FOR SERIOUS DATA CRUNCHERS! Now, you can apply cutting-edge predictive analytics techniques to help your business win–and you don’t need multimillion-dollar software to do it. All the tools you need are available in Microsoft Excel 2016, and all the knowledge and skills are right here, in this book! Microsoft Excel MVP Conrad Carlberg shows you how to use Excel predictive analytics to solve real problems in areas ranging from sales and marketing to operations. Carlberg offers unprecedented insight into building powerful, credible, and reliable forecasts, helping you gain deep insights from Excel that would be difficult to uncover with costly tools such as SAS or SPSS. Fully updated for Excel 2016, this guide contains valuable new coverage of accounting for seasonality and managing complex consumer choice scenarios. Throughout, Carlberg provides downloadable Excel 2016 workbooks you can easily adapt to your own needs, plus VBA code–much of it open-source–to streamline especially complex techniques. Step by step, you’ll build on Excel skills you already have, learning advanced techniques that can help you increase revenue, reduce costs, and improve productivity. By mastering predictive analytics, you’ll gain a powerful competitive advantage for your company and yourself. Learn the “how” and “why” of using data to make better decisions, and choose the right technique for each problem Capture live real-time data from diverse sources, including third-party websites Use logistic regression to predict behaviors such as “will buy” versus “won’t buy” Distinguish random data bounces from real, fundamental changes Forecast time series with smoothing and regression Account for trends and seasonality via Holt-Winters smoothing Prevent trends from running out of control over long time horizons Construct more accurate predictions by using Solver Manage large numbers of variables and unwieldy datasets with principal components analysis and Varimax factor rotation Apply ARIMA (Box-Jenkins) techniques to build better forecasts and clarify their meaning Handle complex consumer choice problems with advanced logistic regression Benchmark Excel results against R results
This book is about predictive analytics. Yet, each chapter could easily be handled by an entire volume of its own. So one might think of this a survey of predictive modeling. A predictive model is a statistical model or machine learning model used to predict future behavior based on past behavior. In order to use this book, one should have a basic understanding of mathematical statistics - it is an advanced book. Some theoretical foundations are laid out but not proven, but references are provided for additional coverage. Every chapter culminates in an example using R. R is a free software environment for statistical computing and graphics. You may download R, from a preferred CRAN mirror at http: //www.r-project.org/. The book is organized so that statistical models are presented first (hopefully in a logical order), followed by machine learning models, and then applications: uplift modeling and time series. One could use this a textbook with problem solving in R-but there are no "by-hand" exercises.
Master the art of predictive modeling About This Book Load, wrangle, and analyze your data using the world's most powerful statistical programming language Familiarize yourself with the most common data mining tools of R, such as k-means, hierarchical regression, linear regression, Naive Bayes, decision trees, text mining and so on. We emphasize important concepts, such as the bias-variance trade-off and over-fitting, which are pervasive in predictive modeling Who This Book Is For If you work with data and want to become an expert in predictive analysis and modeling, then this Learning Path will serve you well. It is intended for budding and seasoned practitioners of predictive modeling alike. You should have basic knowledge of the use of R, although it's not necessary to put this Learning Path to great use. What You Will Learn Get to know the basics of R's syntax and major data structures Write functions, load data, and install packages Use different data sources in R and know how to interface with databases, and request and load JSON and XML Identify the challenges and apply your knowledge about data analysis in R to imperfect real-world data Predict the future with reasonably simple algorithms Understand key data visualization and predictive analytic skills using R Understand the language of models and the predictive modeling process In Detail Predictive analytics is a field that uses data to build models that predict a future outcome of interest. It can be applied to a range of business strategies and has been a key player in search advertising and recommendation engines. The power and domain-specificity of R allows the user to express complex analytics easily, quickly, and succinctly. R offers a free and open source environment that is perfect for both learning and deploying predictive modeling solutions in the real world. This Learning Path will provide you with all the steps you need to master the art of predictive modeling with R. We start with an introduction to data analysis with R, and then gradually you'll get your feet wet with predictive modeling. You will get to grips with the fundamentals of applied statistics and build on this knowledge to perform sophisticated and powerful analytics. You will be able to solve the difficulties relating to performing data analysis in practice and find solutions to working with “messy data”, large data, communicating results, and facilitating reproducibility. You will then perform key predictive analytics tasks using R, such as train and test predictive models for classification and regression tasks, score new data sets and so on. By the end of this Learning Path, you will have explored and tested the most popular modeling techniques in use on real-world data sets and mastered a diverse range of techniques in predictive analytics. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: Data Analysis with R, Tony Fischetti Learning Predictive Analytics with R, Eric Mayor Mastering Predictive Analytics with R, Rui Miguel Forte Style and approach Learn data analysis using engaging examples and fun exercises, and with a gentle and friendly but comprehensive "learn-by-doing" approach. This is a practical course, which analyzes compelling data about life, health, and death with the help of tutorials. It offers you a useful way of interpreting the data that's specific to this course, but that can also be applied to any other data. This course is designed to be both a guide and a reference for moving beyond the basics of predictive modeling.
Author: Thomas W. Miller
Publisher: FT Press
Release Date: 2014-09-29
To succeed with predictive analytics, you must understand it on three levels: Strategy and management Methods and models Technology and code This up-to-the-minute reference thoroughly covers all three categories. Now fully updated, this uniquely accessible book will help you use predictive analytics to solve real business problems and drive real competitive advantage. If you’re new to the discipline, it will give you the strong foundation you need to get accurate, actionable results. If you’re already a modeler, programmer, or manager, it will teach you crucial skills you don’t yet have. Unlike competitive books, this guide illuminates the discipline through realistic vignettes and intuitive data visualizations–not complex math. Thomas W. Miller, leader of Northwestern University’s pioneering program in predictive analytics, guides you through defining problems, identifying data, crafting and optimizing models, writing effective R code, interpreting results, and more. Every chapter focuses on one of today’s key applications for predictive analytics, delivering skills and knowledge to put models to work–and maximize their value. Reflecting extensive student and instructor feedback, this edition adds five classroom-tested case studies, updates all code for new versions of R, explains code behavior more clearly and completely, and covers modern data science methods even more effectively. All data sets, extensive R code, and additional examples available for download at http://www.ftpress.com/miller If you want to make the most of predictive analytics, data science, and big data, this is the book for you. Thomas W. Miller’s unique balanced approach combines business context and quantitative tools, appealing to managers, analysts, programmers, and students alike. Miller addresses multiple business cases and challenges, including segmentation, brand positioning, product choice modeling, pricing research, finance, sports, text analytics, sentiment analysis, and social network analysis. He illuminates the use of cross-sectional data, time series, spatial, and spatio-temporal data. You’ll learn why each problem matters, what data are relevant, and how to explore the data you’ve identified. Miller guides you through conceptually modeling each data set with words and figures; and then modeling it again with realistic R programs that deliver actionable insights. You’ll walk through model construction, explanatory variable subset selection, and validation, mastering best practices for improving out-of-sample predictive performance. Throughout, Miller employs data visualization and statistical graphics to help you explore data, present models, and evaluate performance. This edition adds five new case studies, updates all code for the newest versions of R, adds more commenting to clarify how the code works, and offers a more detailed and up-to-date primer on data science methods. Gain powerful, actionable, profitable insights about: Advertising and promotion Consumer preference and choice Market baskets and related purchases Economic forecasting Operations management Unstructured text and language Customer sentiment Brand and price Sports team performance And much more
Make sense of your data and predict the unpredictable About This Book A unique book that centers around develop six key practical skills needed to develop and implement predictive analytics Apply the principles and techniques of predictive analytics to effectively interpret big data Solve real-world analytical problems with the help of practical case studies and real-world scenarios taken from the world of healthcare, marketing, and other business domains Who This Book Is For This book is for those with a mathematical/statistics background who wish to understand the concepts, techniques, and implementation of predictive analytics to resolve complex analytical issues. Basic familiarity with a programming language of R is expected. What You Will Learn Master the core predictive analytics algorithm which are used today in business Learn to implement the six steps for a successful analytics project Classify the right algorithm for your requirements Use and apply predictive analytics to research problems in healthcare Implement predictive analytics to retain and acquire your customers Use text mining to understand unstructured data Develop models on your own PC or in Spark/Hadoop environments Implement predictive analytics products for customers In Detail This is the go-to book for anyone interested in the steps needed to develop predictive analytics solutions with examples from the world of marketing, healthcare, and retail. We'll get started with a brief history of predictive analytics and learn about different roles and functions people play within a predictive analytics project. Then, we will learn about various ways of installing R along with their pros and cons, combined with a step-by-step installation of RStudio, and a description of the best practices for organizing your projects. On completing the installation, we will begin to acquire the skills necessary to input, clean, and prepare your data for modeling. We will learn the six specific steps needed to implement and successfully deploy a predictive model starting from asking the right questions through model development and ending with deploying your predictive model into production. We will learn why collaboration is important and how agile iterative modeling cycles can increase your chances of developing and deploying the best successful model. We will continue your journey in the cloud by extending your skill set by learning about Databricks and SparkR, which allow you to develop predictive models on vast gigabytes of data. Style and Approach This book takes a practical hands-on approach wherein the algorithms will be explained with the help of real-world use cases. It is written in a well-researched academic style which is a great mix of theoretical and practical information. Code examples are supplied for both theoretical concepts as well as for the case studies. Key references and summaries will be provided at the end of each chapter so that you can explore those topics on their own.
Author: Thomas W. Miller
Publisher: Pearson Education
Release Date: 2014
Genre: Business & Economics
Using Phyton and R, the author addresses multiple business challenge, including segmentation, brand positioning, product choice modeling, pricing research, finance, sprots, text analytics, sentiment analysis and social network analysis, cross sectional data, time series, spatial and spatio-temporal data.
Master machine learning with Python in six steps and explore fundamental to advanced topics, all designed to make you a worthy practitioner. This book’s approach is based on the “Six degrees of separation” theory, which states that everyone and everything is a maximum of six steps away. Mastering Machine Learning with Python in Six Steps presents each topic in two parts: theoretical concepts and practical implementation using suitable Python packages. You’ll learn the fundamentals of Python programming language, machine learning history, evolution, and the system development frameworks. Key data mining/analysis concepts, such as feature dimension reduction, regression, time series forecasting and their efficient implementation in Scikit-learn are also covered. Finally, you’ll explore advanced text mining techniques, neural networks and deep learning techniques, and their implementation. All the code presented in the book will be available in the form of iPython notebooks to enable you to try out these examples and extend them to your advantage. What You'll Learn Examine the fundamentals of Python programming language Review machine Learning history and evolution Understand machine learning system development frameworks Implement supervised/unsupervised/reinforcement learning techniques with examples Explore fundamental to advanced text mining techniques Implement various deep learning frameworks Who This Book Is For Python developers or data engineers looking to expand their knowledge or career into machine learning area. Non-Python (R, SAS, SPSS, Matlab or any other language) machine learning practitioners looking to expand their implementation skills in Python. Novice machine learning practitioners looking to learn advanced topics, such as hyperparameter tuning, various ensemble techniques, natural language processing (NLP), deep learning, and basics of reinforcement learning.
Author: Thomas W. Miller
Publisher: FT Press
Release Date: 2015-05-02
Genre: Business & Economics
Now , a leader of Northwestern University's prestigious analytics program presents a fully-integrated treatment of both the business and academic elements of marketing applications in predictive analytics. Writing for both managers and students, Thomas W. Miller explains essential concepts, principles, and theory in the context of real-world applications. Building on Miller's pioneering program, Marketing Data Science thoroughly addresses segmentation, target marketing, brand and product positioning, new product development, choice modeling, recommender systems, pricing research, retail site selection, demand estimation, sales forecasting, customer retention, and lifetime value analysis. Starting where Miller's widely-praised Modeling Techniques in Predictive Analytics left off, he integrates crucial information and insights that were previously segregated in texts on web analytics, network science, information technology, and programming. Coverage includes: The role of analytics in delivering effective messages on the web Understanding the web by understanding its hidden structures Being recognized on the web – and watching your own competitors Visualizing networks and understanding communities within them Measuring sentiment and making recommendations Leveraging key data science methods: databases/data preparation, classical/Bayesian statistics, regression/classification, machine learning, and text analytics Six complete case studies address exceptionally relevant issues such as: separating legitimate email from spam; identifying legally-relevant information for lawsuit discovery; gleaning insights from anonymous web surfing data, and more. This text's extensive set of web and network problems draw on rich public-domain data sources; many are accompanied by solutions in Python and/or R. Marketing Data Science will be an invaluable resource for all students, faculty, and professional marketers who want to use business analytics to improve marketing performance.
Learn the fundamental aspects of the business statistics, data mining, and machine learning techniques required to understand the huge amount of data generated by your organization. This book explains practical business analytics through examples, covers the steps involved in using it correctly, and shows you the context in which a particular technique does not make sense. Further, Practical Business Analytics using R helps you understand specific issues faced by organizations and how the solutions to these issues can be facilitated by business analytics. This book will discuss and explore the following through examples and case studies: An introduction to R: data management and R functions The architecture, framework, and life cycle of a business analytics project Descriptive analytics using R: descriptive statistics and data cleaning Data mining: classification, association rules, and clustering Predictive analytics: simple regression, multiple regression, and logistic regression This book includes case studies on important business analytic techniques, such as classification, association, clustering, and regression. The R language is the statistical tool used to demonstrate the concepts throughout the book. What You Will Learn • Write R programs to handle data • Build analytical models and draw useful inferences from them • Discover the basic concepts of data mining and machine learning • Carry out predictive modeling • Define a business issue as an analytical problem Who This Book Is For Beginners who want to understand and learn the fundamentals of analytics using R. Students, managers, executives, strategy and planning professionals, software professionals, and BI/DW professionals.
Mine valuable insights from your data using popular tools and techniques in R About This Book Understand the basics of data mining and why R is a perfect tool for it. Manipulate your data using popular R packages such as ggplot2, dplyr, and so on to gather valuable business insights from it. Apply effective data mining models to perform regression and classification tasks. Who This Book Is For If you are a budding data scientist, or a data analyst with a basic knowledge of R, and want to get into the intricacies of data mining in a practical manner, this is the book for you. No previous experience of data mining is required. What You Will Learn Master relevant packages such as dplyr, ggplot2 and so on for data mining Learn how to effectively organize a data mining project through the CRISP-DM methodology Implement data cleaning and validation tasks to get your data ready for data mining activities Execute Exploratory Data Analysis both the numerical and the graphical way Develop simple and multiple regression models along with logistic regression Apply basic ensemble learning techniques to join together results from different data mining models Perform text mining analysis from unstructured pdf files and textual data Produce reports to effectively communicate objectives, methods, and insights of your analyses In Detail R is widely used to leverage data mining techniques across many different industries, including finance, medicine, scientific research, and more. This book will empower you to produce and present impressive analyses from data, by selecting and implementing the appropriate data mining techniques in R. It will let you gain these powerful skills while immersing in a one of a kind data mining crime case, where you will be requested to help resolving a real fraud case affecting a commercial company, by the mean of both basic and advanced data mining techniques. While moving along the plot of the story you will effectively learn and practice on real data the various R packages commonly employed for this kind of tasks. You will also get the chance of apply some of the most popular and effective data mining models and algos, from the basic multiple linear regression to the most advanced Support Vector Machines. Unlike other data mining learning instruments, this book will effectively expose you the theory behind these models, their relevant assumptions and when they can be applied to the data you are facing. By the end of the book you will hold a new and powerful toolbox of instruments, exactly knowing when and how to employ each of them to solve your data mining problems and get the most out of your data. Finally, to let you maximize the exposure to the concepts described and the learning process, the book comes packed with a reproducible bundle of commented R scripts and a practical set of data mining models cheat sheets. Style and approach This book takes a practical, step-by-step approach to explain the concepts of data mining. Practical use-cases involving real-world datasets are used throughout the book to clearly explain theoretical concepts.