InTDS ArchivebyBenjamin BodnerDeep Learning vs Data Science: Who Will Win?What is more important, your data or your model?Oct 22, 202438Oct 22, 202438
InTDS ArchivebyW Brett KennedyFormulaFeatures: A Tool to Generate Highly Predictive Features for Interpretable ModelsCreate more interpretable models by using concise, highly predictive features, automatically engineered based on arithmetic combinations of…Oct 6, 20242Oct 6, 20242
InTDS ArchivebyHadenCyclical Encoding: An Alternative to One-Hot Encoding for Time Series FeaturesCyclical encoding provides your model with the same information using significantly fewer featuresMay 3, 20243May 3, 20243
InThe TechlifebyFabiana ClementeThe 1 Python Package to Profile your Spark dataframesThe great debut of ydata-profiling into the big data landscapeFeb 17, 20231Feb 17, 20231
InPython in Plain EnglishbyNick HemenwayMy Favorite Way to Smooth Noisy Data With PythonNearly all real-world data is noisy. What do I mean by noisy? Consider the following simple example: I’ve got a mass attached to a spring —…Dec 4, 20226Dec 4, 20226
InTDS ArchivebyRob Taylor, PhDMulticollinearity: Problem, or Not?A brief guide on multicollinearity and how it affects multiple regression modelsJun 24, 20225Jun 24, 20225
InTowards AIbyMohneesh SSavitzky-Golay Filter for data SmoothingMost underrated technique while Data PreprocessingJan 4, 20222Jan 4, 20222
Raphael SchoenenbergerEncoding Temporal Features (Part 1)How to teach public holidays to Deep Neural Networks (DNN)Mar 23, 2022Mar 23, 2022
InTDS ArchivebyMark DerdzinskiFive reasons feature engineering is key to maximizing data science impactA framework for evaluating data products and building high-impact teamsJul 28, 2022Jul 28, 2022
InTDS ArchivebyEryk LewinsonThree Approaches to Feature Engineering for Time SeriesUsing dummy variables, cyclical encoding, and radial basis functionsJul 29, 20225Jul 29, 20225
Gianluca MalatoWhy You Shouldn’t Use PCA in a Supervised Machine Learning ProjectSome flaws of Principal Component Analysis that affect supervised machine learning projectsJul 10, 20228Jul 10, 20228
InTDS ArchivebyIndraneel Dutta BaruahDeep-dive on ML techniques for feature selection in Python — Part 2The second part of a series on ML-based feature selection where we discuss popular embedded and wrapper methods like Lasso regression…Jul 10, 2022Jul 10, 2022
Armand SauzaySHAP values: Machine Learning interpretability and feature selection made easy.Machine learning interpretability with hands on code with SHAP.Jun 26, 20222Jun 26, 20222
Brenda LoznikPump it up — How to build a high-ranking modelAfter the hyperparameter tuning step, I combined the best performing models in an ensemble. For the ensemble I used a regular voting…Dec 27, 2021Dec 27, 2021
Brenda LoznikPump it up — Which features should you include in your model?During EDA, I identified features that appeared to have great overlap with other features. Multicollinearity occurs when two or more…Dec 27, 20211Dec 27, 20211
InTDS ArchivebyCristiana de Azevedo von StoschWhy Graph-modeling Frameworks are the Future of Unsupervised LearningCo-authored by Abhishek Singh, Machine Learning Engineer at Bayer Pharmaceuticals, former Microsoft, JPMorgan Chase & Co, HSBC, and by…Apr 25, 20227Apr 25, 20227
InTDS ArchivebyAashish NairStandardization vs NormalizationDistinguishing between two common feature scaling methodsMar 21, 20221Mar 21, 20221
InTDS ArchivebyAndrew EngelNormalizing Features Within GroupsAn alternative approach to standardizationMar 22, 20221Mar 22, 20221
InTDS ArchivebyWing PoonFeature Engineering for Machine Learning (2/3)Part 2: Feature GenerationMar 14, 20221Mar 14, 20221