Open in app
Home
Notifications
Lists
Stories

Write
Karan Choudhary
Karan Choudhary

Home

About

Pinned

Market Basket Analysis Using Association Rule Mining With Apriori ECLAT and FPGROWTH Algorithm

ABOUT DATASET The dataset consists of various item which are brought by user in various transactions . The goal of the competition is to predict which products will be in a user’s next order. The dataset is anonymized and contains a sample of over7501 rows with 20 plus unique item…

Deep Learning

7 min read

Market Basket Analysis Using Association Rule Mining With Apriori ECLAT and FPGROWTH Algorithm
Market Basket Analysis Using Association Rule Mining With Apriori ECLAT and FPGROWTH Algorithm

Published in The Startup

·Pinned

BUILDING RECOMMENDED SYSTEM USING PYSPARK WITH ALS ALGORITHM

Recommender System is an information filtering tool that seeks to predict which product a user will like, and based on that, recommends a few products to the users. For example, Amazon can recommend new shopping items to buy, Netflix can recommend new movies to watch, and Google can recommend news…

Spark

5 min read

BUILDING RECOMMENDED SYSTEM USING PYSPARK WITH ALS ALGORITHM
BUILDING RECOMMENDED SYSTEM USING PYSPARK WITH ALS ALGORITHM

Published in DataDrivenInvestor

·Nov 14, 2020

Big Data using pyspark in Databricks

Implementation of linear regression through pyspark library in Databricks. before starting of the implementation we must familiar to the databricks platform. This is the homepage of the databricks developed specially for spark to implement on it and its is cost free. Before starting of the session we have to make…

Big Data

5 min read

Big Data using pyspark in Databricks
Big Data using pyspark in Databricks

Published in The Startup

·Oct 25, 2020

FEATURE SELECTION ONE OF THE MOST TIME CONSUMING TOPIC IN DATA SCIENCE PROFESSION

FEATURE SELECTION TECNIQUES 1.variance threshold →constant →quasi constant →duplicate features 2.fisher score 3.chi-square test 4.mutual information # In[1]: #importig the libararies import pandas as pd import numpy as np # In[2]: bc_data=pd.read_csv(‘breast_cancer.csv’) # In[8]: bc_data.head() # In[10]: bc_data.shape # In[5]: bc_data.columns # In[6]: bc_data.info() # In[7]: bc_data.corr() # In[11]:

Machine Learning

2 min read

FEATURE SELECTION ONE OF THE MOST TIME CONSUMING TOPIC IN DATA SCIENCE PROFESSION
FEATURE SELECTION ONE OF THE MOST TIME CONSUMING TOPIC IN DATA SCIENCE PROFESSION

Published in The Startup

·Oct 9, 2020

BIGDATA MRJOB AND PIG LATIN SCRIPTS

Create two data files with names and datasets as shown below and then implement the following queries using mrjob package in python for map reduce programming and using pigalso for the same.Compare the two processing types. We have dataset of employee and expenses in form of txt Employee.txt 101,Abhay,20000,1 102,Shiv,10000,2 …

Big Data

3 min read

BIGDATA MRJOB AND PIG LATIN SCRIPTS
BIGDATA MRJOB AND PIG LATIN SCRIPTS

Published in DataDrivenInvestor

·Oct 7, 2020

END TO END DEPLOYMENT OF HEART DISEASE PREDICTION USING FLASK (Detailed Explanation)

Importing the libraries. import numpy as np import pandas as pd import matplotlib.pyplot as plt Reading the dataset df = pd.read_csv(“heart.csv”) df.head() #printing the all the columns with first 5 rows Dataset with independent and dependent variables. age age in years Independent Variable sex (1 = male; 0 = female) …

Deep Learning

9 min read

END TO END DEPLOYMENT OF HEART DISEASE PREDICTION USING FLASK (Detailed Explanation)
END TO END DEPLOYMENT OF HEART DISEASE PREDICTION USING FLASK (Detailed Explanation)

Published in The Startup

·Sep 1, 2020

My First Kaggle notebook covering majority of algorithm and visualization of machine learning

NASA ASTEROID PREDICTION DATASET FROM KAGGLE TOPICS TO BE COVERED: 1.KNN 2.SVC 3.RANDOM FOREST CLASSIFIER 4.ADA BOOST 5.GRADIENT BOSSTING 6.XG BOOST 7.FEATURE SELECTION,CROSS VALIDATION ,HYPERPARAMETER TUNING,RUC AUC GRAPH FOR ALL THE MODELS 8.ARTIFICAL NEURAL NETWORK (DEEP LEARNING) In [ ]: #Importing library import pandas as pd import matplotlib.pyplot as plt import numpy as np import seaborn as sns

Kaggle

16 min read


Published in Analytics Vidhya

·Aug 22, 2020

All in one clustering techniques in machine learning you should know in unsupervised learning

Clustering algorithms are a powerful technique for machine learning on unsupervised data. The most common algorithms in machine learning are hierarchical clustering and K-Means clustering. These two algorithms are incredibly powerful when applied to different machine learning problems. Cluster analysis can be a powerful data-mining tool for any organization that…

Clustering

13 min read

All in one clustering techniques in machine learning you should know in unsupervised learning
All in one clustering techniques in machine learning you should know in unsupervised learning

Published in The Startup

·Aug 18, 2020

End to End Deployment of Heart Disease Prediction Through Flask With Machine Learning Algorithm

Artificial Intelligence directly translates to conceptualizing and building machines that can think and hence are independently capable of performing tasks, thus exhibiting intelligence. If this advancement in technology is a boon or a bane to humans and our surroundings is a never-ending debate. Every coin has its two faces so…

Deep Learning

7 min read

End to End Deployment of Heart Disease Prediction Through Flask With Machine learning Algorithm
End to End Deployment of Heart Disease Prediction Through Flask With Machine learning Algorithm

Published in Analytics Vidhya

·Jul 29, 2020

Introduction to Unsupervised deep learning with Self Organizing Map with its Architecture and Visualization.

Self Organizing Map(SOM) is an unsupervised neural network machine learning technique. SOM is used when the dataset has a lot of attributes because it produces a low-dimensional, most of times two-dimensional, output. The output is a discretised representation of the input space called map. How SOM works? The points in input space have…

Deep Learning

8 min read

Introduction to Unsupervised deep learning with Self Organizing Map with its Architecture and…
Introduction to Unsupervised deep learning with Self Organizing Map with its Architecture and…
Karan Choudhary

Karan Choudhary

TECH LEARNER

Following
  • Amit Chauhan

    Amit Chauhan

  • Ednalyn C. De Dios

    Ednalyn C. De Dios

  • Ahmed Besbes

    Ahmed Besbes

  • Thomas Nield

    Thomas Nield

  • Team AV

    Team AV

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Knowable