The book is written for researchers in social science and marketing field, especially for those with little or no knowledge in computer programming. Data analytics has become part and parcel in the contemporary technologically fast paced world. We have amazing tools and software that allow us to analyse data available in various formats. However, most of the popular paid software and packages for data analysis is not affordable or not even accessible for the students, researchers. This is true in the case of many NGOs and agencies how are involved in community based research in developing countries. We have popular open source platforms and tools such as R and Python for data analysis. This book makes use of Python because of its simplicity, adaptability, broader scope and greater potential in advanced data mining and text mining contexts. We found it as a need to educate and train the researchers from social science and marketing research background, so that they could make use of Python, a promising tool to meet simple to extremely complex data analyses needs free of cost. The learnings from this book will not only help them in doing their conventional data analyses but also enable them to pursue advanced knowledge in machine learning algorithms, text analytics and other new generation techniques with the support of freely accessible open source platforms. Since the objective of the book is to educate the researchers with no programming background, we have made every effort to give hands-on experience in learning some basic coding in Python, which is sufficient for the readers to follow the book. The step-by-step procedure to do various data processing and analysis described in this book will make it easy for the users. Apart from that, we have tried our level best to give explanations on specific codes and how they perform to get us the desired output. We also request you to give you valuable comments and suggestions on the book, via our blog, so that we could improve the same in the upcoming volumes. We commit ourselves to providing explanations to the readers' questions related to the codes and analysis provided in this book. The book specifically deals with data sets of row and column format, as the general format commonly used in social science research, which most of the researchers are familiar with. So we do not work with arrays and dictionaries, except in one or two occasions (only to make you familiar with that) instead prefer to make use of Excel data and pandas data frame. The book consists of thirteen chapters. The first chapter gives an introduction to Python and its relevance and scope in contemporary data analysis contexts. Ch. 2 teaches the basics and Python coding, Ch. 3-7, provide a step-by-step narration of how to enter data, process it, preliminary analysis and data cleaning with the help of Python, Ch.8-9, present data visualizations and narration techniques using Python; Ch.10.demonstrate how Python can use for statistical analysis. The remaining chapters are focusing on giving more real life situations in data analysis and the practical solutions to handle them. The exercises provided in the book are similar to real analysis situations, and that will help the reader for an easy transition to the data analyst jobs. The authors have taken utmost care identifying and providing solutions to all practical difficulties the readers may face while using Python for data analysis purpose. The authors have developed a series of codes and have incorporated them to make data processing and analysis convenient and easy for the researchers. The self-learning materials given in this book will help social science and marketing researchers to deepen their understanding of various steps in data processing and analyses and to gain advanced skills in using Python for this purpose.
Learn how to perform data analysis with the R language and software environment, even if you have little or no programming experience. With the tutorials in this hands-on guide, you’ll learn how to use the essential R tools you need to know to analyze data, including data types and programming concepts. The second half of Learning R shows you real data analysis in action by covering everything from importing data to publishing your results. Each chapter in the book includes a quiz on what you’ve learned, and concludes with exercises, most of which involve writing R code. Write a simple R program, and discover what the language can do Use data types such as vectors, arrays, lists, data frames, and strings Execute code conditionally or repeatedly with branches and loops Apply R add-on packages, and package your own work for others Learn how to clean data you import from a variety of sources Understand data through visualization and summary statistics Use statistical models to pass quantitative judgments about data and make predictions Learn what to do when things go wrong while writing data analysis code
Author: Thomas W. Miller
Publisher: FT Press
Release Date: 2014-09-29
Master predictive analytics, from start to finish Start with strategy and management Master methods and build models Transform your models into highly-effective code—in both Python and R This one-of-a-kind book will help you use predictive analytics, Python, and R to solve real business problems and drive real competitive advantage. You’ll master predictive analytics through realistic case studies, intuitive data visualizations, and up-to-date code for both Python and R—not complex math. Step by step, you’ll walk through defining problems, identifying data, crafting and optimizing models, writing effective Python and R code, interpreting results, and more. Each chapter focuses on one of today’s key applications for predictive analytics, delivering skills and knowledge to put models to work—and maximize their value. Thomas W. Miller, leader of Northwestern University’s pioneering program in predictive analytics, addresses everything you need to succeed: strategy and management, methods and models, and technology and code. If you’re new to predictive analytics, you’ll gain a strong foundation for achieving accurate, actionable results. If you’re already working in the field, you’ll master powerful new skills. If you’re familiar with either Python or R, you’ll discover how these languages complement each other, enabling you to do even more. All data sets, extensive Python and R code, and additional examples available for download at http://www.ftpress.com/miller/ Python and R offer immense power in predictive analytics, data science, and big data. This book will help you leverage that power to solve real business problems, and drive real competitive advantage. Thomas W. Miller’s unique balanced approach combines business context and quantitative tools, illuminating each technique with carefully explained code for the latest versions of Python and R. If you’re new to predictive analytics, Miller gives you a strong foundation for achieving accurate, actionable results. If you’re already a modeler, programmer, or manager, you’ll learn crucial skills you don’t already have. Using Python and R, Miller addresses multiple business challenges, including segmentation, brand positioning, product choice modeling, pricing research, finance, sports, text analytics, sentiment analysis, and social network analysis. He illuminates the use of cross-sectional data, time series, spatial, and spatio-temporal data. You’ll learn why each problem matters, what data are relevant, and how to explore the data you’ve identified. Miller guides you through conceptually modeling each data set with words and figures; and then modeling it again with realistic code that delivers actionable insights. You’ll walk through model construction, explanatory variable subset selection, and validation, mastering best practices for improving out-of-sample predictive performance. Miller employs data visualization and statistical graphics to help you explore data, present models, and evaluate performance. Appendices include five complete case studies, and a detailed primer on modern data science methods. Use Python and R to gain powerful, actionable, profitable insights about: Advertising and promotion Consumer preference and choice Market baskets and related purchases Economic forecasting Operations management Unstructured text and language Customer sentiment Brand and price Sports team performance And much more
Analyzing Social Media Networks with NodeXL: Insights from a Connected World, Second Edition, provides readers with a thorough, practical and updated guide to NodeXL, the open-source social network analysis (SNA) plug-in for use with Excel. The book analyzes social media, provides a NodeXL tutorial, and presents network analysis case studies, all of which are revised to reflect the latest developments. Sections cover history and concepts, mapping and modeling, the detailed operation of NodeXL, and case studies, including e-mail, Twitter, Facebook, Flickr and YouTube. In addition, there are descriptions of each system and types of analysis for identifying people, documents, groups and events. This book is perfect for use as a course text in social network analysis or as a guide for practicing NodeXL users. Walks users through NodeXL while also explaining the theory and development behind each step Demonstrates how visual analytics research can be applied to SNA tools for the mass market Includes updated case studies from researchers who use NodeXL on popular networks like email, Facebook, Twitter, and Instagram Includes downloadable companion materials and online resources at https://www.smrfoundation.org/nodexl/teaching-with-nodexl/teaching-resources/
Author: Thomas W. Miller
Publisher: FT Press
Release Date: 2015-05-02
Genre: Business & Economics
Now , a leader of Northwestern University's prestigious analytics program presents a fully-integrated treatment of both the business and academic elements of marketing applications in predictive analytics. Writing for both managers and students, Thomas W. Miller explains essential concepts, principles, and theory in the context of real-world applications. Building on Miller's pioneering program, Marketing Data Science thoroughly addresses segmentation, target marketing, brand and product positioning, new product development, choice modeling, recommender systems, pricing research, retail site selection, demand estimation, sales forecasting, customer retention, and lifetime value analysis. Starting where Miller's widely-praised Modeling Techniques in Predictive Analytics left off, he integrates crucial information and insights that were previously segregated in texts on web analytics, network science, information technology, and programming. Coverage includes: The role of analytics in delivering effective messages on the web Understanding the web by understanding its hidden structures Being recognized on the web – and watching your own competitors Visualizing networks and understanding communities within them Measuring sentiment and making recommendations Leveraging key data science methods: databases/data preparation, classical/Bayesian statistics, regression/classification, machine learning, and text analytics Six complete case studies address exceptionally relevant issues such as: separating legitimate email from spam; identifying legally-relevant information for lawsuit discovery; gleaning insights from anonymous web surfing data, and more. This text's extensive set of web and network problems draw on rich public-domain data sources; many are accompanied by solutions in Python and/or R. Marketing Data Science will be an invaluable resource for all students, faculty, and professional marketers who want to use business analytics to improve marketing performance.
Learn to solve challenging data science problems by building powerful machine learning models using Python About This Book Understand which algorithms to use in a given context with the help of this exciting recipe-based guide This practical tutorial tackles real-world computing problems through a rigorous and effective approach Build state-of-the-art models and develop personalized recommendations to perform machine learning at scale Who This Book Is For This Learning Path is for Python programmers who are looking to use machine learning algorithms to create real-world applications. It is ideal for Python professionals who want to work with large and complex datasets and Python developers and analysts or data scientists who are looking to add to their existing skills by accessing some of the most powerful recent trends in data science. Experience with Python, Jupyter Notebooks, and command-line execution together with a good level of mathematical knowledge to understand the concepts is expected. Machine learning basic knowledge is also expected. What You Will Learn Use predictive modeling and apply it to real-world problems Understand how to perform market segmentation using unsupervised learning Apply your new-found skills to solve real problems, through clearly-explained code for every technique and test Compete with top data scientists by gaining a practical and theoretical understanding of cutting-edge deep learning algorithms Increase predictive accuracy with deep learning and scalable data-handling techniques Work with modern state-of-the-art large-scale machine learning techniques Learn to use Python code to implement a range of machine learning algorithms and techniques In Detail Machine learning is increasingly spreading in the modern data-driven world. It is used extensively across many fields such as search engines, robotics, self-driving cars, and more. Machine learning is transforming the way we understand and interact with the world around us. In the first module, Python Machine Learning Cookbook, you will learn how to perform various machine learning tasks using a wide variety of machine learning algorithms to solve real-world problems and use Python to implement these algorithms. The second module, Advanced Machine Learning with Python, is designed to take you on a guided tour of the most relevant and powerful machine learning techniques and you'll acquire a broad set of powerful skills in the area of feature selection and feature engineering. The third module in this learning path, Large Scale Machine Learning with Python, dives into scalable machine learning and the three forms of scalability. It covers the most effective machine learning techniques on a map reduce framework in Hadoop and Spark in Python. This Learning Path will teach you Python machine learning for the real world. The machine learning techniques covered in this Learning Path are at the forefront of commercial practice. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: Python Machine Learning Cookbook by Prateek Joshi Advanced Machine Learning with Python by John Hearty Large Scale Machine Learning with Python by Bastiaan Sjardin, Alberto Boschetti, Luca Massaron Style and approach This course is a smooth learning path that will teach you how to get started with Python machine learning for the real world, and develop solutions to real-world problems. Through this comprehensive course, you'll learn to create the most effective machine learning techniques from scratch and more!
If you are an aspiring data scientist and you have at least a working knowledge of data analysis and Python, this book will get you started in data science. Data analysts with experience of R or MATLAB will also find the book to be a comprehensive reference to enhance their data manipulation and machine learning skills.
Over 60 practical recipes on data exploration and analysis About This Book Clean dirty data, extract accurate information, and explore the relationships between variables Forecast the output of an electric plant and the water flow of American rivers using pandas, NumPy, Statsmodels, and scikit-learn Find and extract the most important features from your dataset using the most efficient Python libraries Who This Book Is For If you are a beginner or intermediate-level professional who is looking to solve your day-to-day, analytical problems with Python, this book is for you. Even with no prior programming and data analytics experience, you will be able to finish each recipe and learn while doing so. What You Will Learn Read, clean, transform, and store your data usng Pandas and OpenRefine Understand your data and explore the relationships between variables using Pandas and D3.js Explore a variety of techniques to classify and cluster outbound marketing campaign calls data of a bank using Pandas, mlpy, NumPy, and Statsmodels Reduce the dimensionality of your dataset and extract the most important features with pandas, NumPy, and mlpy Predict the output of a power plant with regression models and forecast water flow of American rivers with time series methods using pandas, NumPy, Statsmodels, and scikit-learn Explore social interactions and identify fraudulent activities with graph theory concepts using NetworkX and Gephi Scrape Internet web pages using urlib and BeautifulSoup and get to know natural language processing techniques to classify movies ratings using NLTK Study simulation techniques in an example of a gas station with agent-based modeling In Detail Data analysis is the process of systematically applying statistical and logical techniques to describe and illustrate, condense and recap, and evaluate data. Its importance has been most visible in the sector of information and communication technologies. It is an employee asset in almost all economy sectors. This book provides a rich set of independent recipes that dive into the world of data analytics and modeling using a variety of approaches, tools, and algorithms. You will learn the basics of data handling and modeling, and will build your skills gradually toward more advanced topics such as simulations, raw text processing, social interactions analysis, and more. First, you will learn some easy-to-follow practical techniques on how to read, write, clean, reformat, explore, and understand your data—arguably the most time-consuming (and the most important) tasks for any data scientist. In the second section, different independent recipes delve into intermediate topics such as classification, clustering, predicting, and more. With the help of these easy-to-follow recipes, you will also learn techniques that can easily be expanded to solve other real-life problems such as building recommendation engines or predictive models. In the third section, you will explore more advanced topics: from the field of graph theory through natural language processing, discrete choice modeling to simulations. You will also get to expand your knowledge on identifying fraud origin with the help of a graph, scrape Internet websites, and classify movies based on their reviews. By the end of this book, you will be able to efficiently use the vast array of tools that the Python environment has to offer. Style and approach This hands-on recipe guide is divided into three sections that tackle and overcome real-world data modeling problems faced by data analysts/scientist in their everyday work. Each independent recipe is written in an easy-to-follow and step-by-step fashion.
Tap into the realm of social media and unleash the power of analytics for data-driven insights using R About This Book A practical guide written to help leverage the power of the R eco-system to extract, process, analyze, visualize and model social media data Learn about data access, retrieval, cleaning, and curation methods for data originating from various social media platforms. Visualize and analyze data from social media platforms to understand and model complex relationships using various concepts and techniques such as Sentiment Analysis, Topic Modeling, Text Summarization, Recommendation Systems, Social Network Analysis, Classification, and Clustering. Who This Book Is For It is targeted at IT professionals, Data Scientists, Analysts, Developers, Machine Learning Enthusiasts, social media marketers and anyone with a keen interest in data, analytics, and generating insights from social data. Some background experience in R would be helpful, but not necessary, since this book is written keeping in mind, that readers can have varying levels of expertise. What You Will Learn Learn how to tap into data from diverse social media platforms using the R ecosystem Use social media data to formulate and solve real-world problems Analyze user social networks and communities using concepts from graph theory and network analysis Learn to detect opinion and sentiment, extract themes, topics, and trends from unstructured noisy text data from diverse social media channels Understand the art of representing actionable insights with effective visualizations Analyze data from major social media channels such as Twitter, Facebook, Flickr, Foursquare, Github, StackExchange, and so on Learn to leverage popular R packages such as ggplot2, topicmodels, caret, e1071, tm, wordcloud, twittR, Rfacebook, dplyr, reshape2, and many more In Detail The Internet has truly become humongous, especially with the rise of various forms of social media in the last decade, which give users a platform to express themselves and also communicate and collaborate with each other. This book will help the reader to understand the current social media landscape and to learn how analytics can be leveraged to derive insights from it. This data can be analyzed to gain valuable insights into the behavior and engagement of users, organizations, businesses, and brands. It will help readers frame business problems and solve them using social data. The book will also cover several practical real-world use cases on social media using R and its advanced packages to utilize data science methodologies such as sentiment analysis, topic modeling, text summarization, recommendation systems, social network analysis, classification, and clustering. This will enable readers to learn different hands-on approaches to obtain data from diverse social media sources such as Twitter and Facebook. It will also show readers how to establish detailed workflows to process, visualize, and analyze data to transform social data into actionable insights. Style and approach This book follows a step-by-step approach with detailed strategies for understanding, extracting, analyzing, visualizing, and modeling data from several major social network platforms such as Facebook, Twitter, Foursquare, Flickr, Github, and StackExchange. The chapters cover several real-world use cases and leverage data science, machine learning, network analysis, and graph theory concepts along with the R ecosystem, including popular packages such as ggplot2, caret,dplyr, topicmodels, tm, and so on.
Develop real-world applications powered by the latest advances in intelligent systems Key Features Gain real-world contextualization using deep learning problems concerning research and application Get to know the best practices to improve and optimize your machine learning systems and algorithms Design and implement machine intelligence using real-world AI-based examples Book Description This Learning Path offers practical knowledge and techniques you need to create and contribute to machine learning, deep learning, and modern data analysis. You will be introduced to various machine learning and deep learning algorithms from scratch, and show you how to apply them to practical industry challenges using realistic and interesting examples. You will learn to build powerful, robust, and accurate predictive models with the power of TensorFlow, combined with other open-source Python libraries. Throughout the Learning Path, you’ll learn how to develop deep learning applications for machine learning systems. Discover how to attain deep learning programming on GPU in a distributed way. By the end of this Learning Path, you know the fundamentals of AI and have worked through a number of case studies that will help you apply your skills to real-world projects. This Learning Path includes content from the following Packt products: Artificial Intelligence By Example by Denis Rothman Python Deep Learning Projects by Matthew Lamons, Rahul Kumar, and Abhishek Nagaraja Hands-On Artificial Intelligence with TensorFlow by Amir Ziai, Ankit Dixit What you will learn Use adaptive thinking to solve real-life AI case studies Rise beyond being a modern-day factory code worker Understand future AI solutions and adapt quickly to them Master deep neural network implementation using TensorFlow Predict continuous target outcomes using regression analysis Dive deep into textual and social media data using sentiment analysis Who this book is for This Learning Path is for anyone who wants to understand the fundamentals of Artificial Intelligence and implement it practically by devising smart solutions. You will learn to extend your machine learning and deep learning knowledge by creating practical AI smart solutions. Prior experience with Python and statistical knowledge is essential to make the most out of this Learning Path.
For more than 40 years, Computerworld has been the leading source of technology news and information for IT influencers worldwide. Computerworld's award-winning Web site (Computerworld.com), twice-monthly publication, focused conference series and custom research form the hub of the world's largest global IT media network.
Author: Richard Brath
Publisher: John Wiley & Sons
Release Date: 2015-01-30
Wring more out of the data with a scientific approach toanalysis Graph Analysis and Visualization brings graph theory outof the lab and into the real world. Using sophisticated methods andtools that span analysis functions, this guide shows you how toexploit graph and network analytic techniques to enable thediscovery of new business insights and opportunities. Published infull color, the book describes the process of creating powerfulvisualizations using a rich and engaging set of examples fromsports, finance, marketing, security, social media, and more. Youwill find practical guidance toward pattern identification andusing various data sources, including Big Data, plus clearinstruction on the use of software and programming. The companionwebsite offers data sets, full code examples in Python, and linksto all the tools covered in the book. Science has already reaped the benefit of network and graphtheory, which has powered breakthroughs in physics, economics,genetics, and more. This book brings those proven techniques intothe world of business, finance, strategy, and design, helpingextract more information from data and better communicate theresults to decision-makers. Study graphical examples of networks using clear and insightfulvisualizations Analyze specifically-curated, easy-to-use data sets fromvarious industries Learn the software tools and programming languages that extractinsights from data Code examples using the popular Python programminglanguage There is a tremendous body of scientific work on network andgraph theory, but very little of it directly applies to analystfunctions outside of the core sciences – until now. Writtenfor those seeking empirically based, systematic analysis methodsand powerful tools that apply outside the lab, Graph Analysisand Visualization is a thorough, authoritative resource.
Author: Nicholas J. Radcliffe
Publisher: "O'Reilly Media, Inc."
Release Date: 2012-02-23
About the Book Recent industry surveys expect the cloud computing services market to be in excess of $20 billion and cloud computing jobs to be in excess of 10 million worldwide in 2014 alone. In addition, since a majority of existing information technology (IT) jobs is focused on maintaining legacy in-house systems, the demand for these kinds of jobs is likely to drop rapidly if cloud computing continues to take hold of the industry. However, there are very few educational options available in the area of cloud computing beyond vendor-specific training by cloud providers themselves. Cloud computing courses have not found their way (yet) into mainstream college curricula. This book is written as a textbook on cloud computing for educational programs at colleges. It can also be used by cloud service providers who may be interested in offering a broader perspective of cloud computing to accompany their own customer and employee training programs. The typical reader is expected to have completed a couple of courses in programming using traditional high-level languages at the college-level, and is either a senior or a beginning graduate student in one of the science, technology, engineering or mathematics (STEM) fields. We have tried to write a comprehensive book that transfers knowledge through an immersive "hands-on approach", where the reader is provided the necessary guidance and knowledge to develop working code for real-world cloud applications. Additional support is available at the book's website: www.cloudcomputingbook.info Organization The book is organized into three main parts. Part I covers technologies that form the foundations of cloud computing. These include topics such as virtualization, load balancing, scalability & elasticity, deployment, and replication. Part II introduces the reader to the design & programming aspects of cloud computing. Case studies on design and implementation of several cloud applications in the areas such as image processing, live streaming and social networks analytics are provided. Part III introduces the reader to specialized aspects of cloud computing including cloud application benchmarking, cloud security, multimedia applications and big data analytics. Case studies in areas such as IT, healthcare, transportation, networking and education are provided.