Detailansicht

Data Mining and Business Analytics with R

eBook

Ledolter, Johannes

WILEY

ISBN/EAN: 9781118572153

Umbreit-Nr.: 4512562

Sprache: Englisch

Umfang: 368 S., 10.97 MB

Format in cm:

Einband: Keine Angabe

Erschienen am 28.05.2013

Auflage: 1/2013

E-Book
Format: EPUB
DRM: Adobe DRM

€ 103,99

(inklusive MwSt.)

Sofort Lieferbar

Beim Buchhandel bestellen

Zusatztext
- Collecting, analyzing, and extracting valuable information from a large amount of data requires easily accessible, robust, computational and analytical tools.Data Mining and Business Analytics with R utilizes the open source software R for the analysis, exploration, and simplification of large high-dimensional data sets. As a result, readers are provided with the needed guidance to model and interpret complicated data and become adept at building powerful models for prediction and classification.Highlighting both underlying concepts and practical computational skills,Data Mining and Business Analytics with R begins with coverage of standard linear regression and the importance of parsimony in statistical modeling. The book includes important topics such as penalty-based variable selection (LASSO); logistic regression; regression and classification trees; clustering; principal components and partial least squares; and the analysis of text and network data. In addition, the book presents:<ul><li>A thorough discussion and extensive demonstration of the theory behind the most useful data mining tools</li><li>Illustrations of how to use the outlined concepts in real-world situations</li><li>Readily available additional data sets and related R code allowing readers to apply their own analyses to the discussed materials</li><li>Numerous exercises to help readers with computing skills and deepen their understanding of the material</li></ul>Data Mining and Business Analytics with R is an excellent graduate-level textbook for courses on data mining and business analytics. The book is also a valuable reference for practitioners who collect and analyze data in the fields of finance, operations management, marketing, and the information sciences.
Kurztext
- InhaltsangabePrefaceAcknowledgements1. IntroductionReference2. Processing the Information and Getting to Know Your Data2.1 Example 1: 2006 Birth Data2.2 Example 2: Alumni Donations2.3 Example 3: Orange JuiceReferences3. Standard Linear Regression3.1 Example 1: Fuel Efficiency of Automobiles3.2 Example 2: Toyota Used Car PricesAppendix: The Effects of Model Over fitting on the Average Mean Square Error of the Regression PredictionReferences4. Local Polynomial Regression: A Nonparametric Regression Approach4.1 Example 1: Old Faithful4.2 Example 2: NOx Exhaust EmissionsReferences5. Importance of Parsimony in Statistical ModelingReferences6. Penalty-Based Variable Selection in Regression Models with Many Parameters (LASSO)6.1 Example 1: Prostate Cancer6.2 Example 2: Orange Juice References7. Logistic Regression7.1 Example 1: Death Penalty Data7.2 Example 2: Delayed Airplanes7.3 Example 3: Loan Acceptance7.4 Example 4: German Credit DataReferences8. Binary Classification, Probabilities and Evaluating Classification Performance8.1 Example: German Credit DataReferences9. Classification Using a Nearest Neighbor Analysis9.1 Example 1: Forensic Glass9.2 Example 2: German Credit Data10. The Naïve Bayesian Analysis: A Model for Predicting a Categorical Response from Mostly Categorical Predictor Variables10.1 Example: Delayed AirplanesReference11. Multinomial Logistic Regression11.1 Example 1: Forensic Glass11.2 Example 2: Forensic Glass RevisitedAppendix: Specification of a Simple Triplet MatrixReferences12. More on Classification and a Discussion of Discriminant Analysis 12.1 Example 1: German Credit Data12.2 Example 2: Fisher Iris Data12.3 Example 3: Forensic Glass Data12.4 Example 4: MBA Admission DataReference13. Decision Trees13.1Example 1: Prostate Cancer13.2 Example 2: Motorcycle Acceleration13.3 Example 3: Fisher Iris Data Revisited14. Further Discussion on Regression and Classification Trees, Computer Software, and Other Useful Classification Methods14.1 R Packages for Tree Construction14.2 CHAID14.3Ensemble Methods: Bagging, Boosting, and Random Forests14.4Support Vector Machines (SVM)14.5Neural Networks14.6The R Package rattle: A Useful Graphical User Interface for Data MiningReferences15. Clustering15.1 k-means Clustering15.2Another Way to Look at Clustering: Applying the Expectation Maximization (EM) Algorithm to Mixtures of Normal Distributions15.3 Hierarchical Clustering ProceduresReferences16. Market Basket Analysis: Association Rules and Lift16.1 Example 1: Estimate of “Slant” and Partial Least SquaresReferences20. Analysis of Network Data20.1 Example 1: Marriage and Power in 15th Century Florence20.2 Example 2: Connections in a Friendship NetworkReferencesAppendix: ExercisesExercises 1Exercises 2Exercises 3Exercises 4Exercise 5Exercise 6Exercises 7Appendix: ReferencesIndex
Autorenportrait
- JOHANNES LEDOLTER, PhD, is Professor in both the Department of Management Sciences and the Department of Statistics and Actuarial Science at the University of Iowa. He is a Fellow of the American Statistical Association and the American Society for Quality, and an Elected Member of the International Statistical Institute. Dr. Ledolter is the coauthor ofStatistical Methods for Forecasting, Achieving Quality Through Continual Improvement, andStatistical Quality Control: Strategies and Tools for Continual Improvement, all published by Wiley.

Detailansicht

Data Mining and Business Analytics with R

Unternehmen

Handel

Verlage

Logistik

E-Commerce

Kontakt

Newsroom