The latest edition of this book delves deep into advanced analytics, focusing on enhancing Python and R proficiency within Power BI. New chapters cover optimizing Python and R settings, utilizing Intel's Math Kernel Library (MKL) for performance boosts, and addressing integration challenges. Techniques for managing large datasets beyond available RAM, employing the Parquet data format, and advanced fuzzy matching algorithms are explored. Additionally, it discusses leveraging SQL Server Language Extensions to overcome traditional Python and R limitations in Power BI. It also helps in crafting sophisticated visualizations using the Grammar of Graphics in both R and Python.
This Power BI book will help you master data validation with regular expressions, import data from diverse sources, and apply advanced algorithms for transformation. You'll learn how to safeguard personal data in Power BI with techniques like pseudonymization, anonymization, and data masking. You'll also get to grips with the key statistical features of datasets by plotting multiple visual graphs in the process of building a machine learning model. The book will guide you on utilizing external APIs for enrichment, enhancing I/O performance, and leveraging Python and R for analysis.
You'll reinforce your learning with questions at the end of each chapter.
Configure optimal integration of Python and R with Power BI
Perform complex data manipulations not possible by default in Power BI
Boost Power BI logging and loading large datasets
Extract insights from your data using algorithms like linear optimization
Calculate string distances and learn how to use them for probabilistic fuzzy matching
Handle outliers and missing values for multivariate and time-series data
Apply Exploratory Data Analysis in Power BI with R
Learn to use Grammar of Graphics in Python