World's Best Top Ranked #2 Certificate

Data Science Architect Program

World's Best Top Ranked #2 Certificate

Data Science Architect Program

Gain 1.5 years of Research Project Experience Certificate
100% Job Guaranteed
320+ hours Live Interactive Sessions with Eminent Data Scientists
Scholarships from day one up to Rs 2 lakhs

Become an IBM Certified Data Science Architect Professional
Remote or Hybrid International internship opportunities in USA, UK, Europe & APAC for Merit Students with 2 years of Project Experience Certificate
Job Guarantee* Program in collaboration with Top PAN India Consulting Firms.
Hackathons (Cash rewards upto Rs.1 Lakh will be awarded to top performers)
320+ hours Live Interactive Sessions with Eminent Data Scientists
Opportunity to conduct research work with GEEKLURN AI Singapore

Geeklurn is now a Member of NASSCOM

GeekLurn is now a Member of NASSCOM & IBM Partner

Play Video

Gain 1.5 years of Research Project Experience Certificate
100% Job Guarantee*
320+ hours Live Interactive Sessions with Eminent Data Scientists
Scholarships from day one up to Rs 2 lakhs

Geeklurn is now a Member of NASSCOM

GeekLurn is now a Member of NASSCOM & IBM Partner

In our journey towards excellence, we are proud to become a member of NASSCOM & IBM Partner. Thankful to all our learners, team and cheerleaders who keep on inspiring us to do better each day.

Program Overview

Key Highlights

24-Months Course Duration

- 6 Months of live interactive classes with Principal/Senior industry Data Scientists
- 18 Months of Sponsored Project Work at Authorised Research Centres funded by IISC, ISB, IIM

320+ hours Live Interactive Sessions with Eminent Data Scientists.

Gain 1.5 years of real-time sponsored Project Expierence Certificate from recognised research centres

Start paying your EMI only after placements at the end of the course

100% Job Guarantee Program in collaboration with Top HR Consulting Firms across India

Scholarships from day ONE up to Rs 2 lakhs based on the type pf Research Projects

Get the opportunity to conduct research work with Singapore based GEEKLURN AI

50+ Sponsored & Funded Research Projects

Get Certified from Top Companies (IBM, Accenture, Microsoft, Oracle...)

Data Science Job Guarantee Overview

Placement Stats

Our Authorised Hiring Partners

Our Alumni

Meet Our GeekLurn-ers who are now Certified Data Science Professionals and were placed in Top MNCs by our HR Team!.

Mahadev Kadam

Data Scientist at Hexaware Technologies

Offered 29 LPA

Product Development Architect - Senior Specialist In FIS

Data Scientist

Teena Prithviraj Jain

Data Scientist at Hexaware Technologies

Adinath Solutions Logo Software Developer at Adinath Solutions

Data Scientist

Bhargav Nishanth

Data Scientist at Hexaware Technologies

Offered 5.5 LPA

Technical Support Specialist at Tech Mahindra

Data Scientist

Shashi Kiran

Data Scientist at
Tata Consultancy Services

Offered 14 LPA

Data Analyst at HCL

Machine Learning Engineer

Vamshi Krishna

Data Scientist
at Accenture

Software Test Engineer

Data Scientist

Shammer Khan

Professional 1 Product Delivery at DXC Technology

Offered 9 LPA

Data Science Trainee

Professional 1 Product Delivery

Chethan

Data Scientist at smart Data Enterprises Inc.

Offered 8 LPA

Design Engineer at Vengala Engineering Design Services (VEDS)

Data Scientist

Aathir Laeeque

Data Scientist at
Brightmoney

Offered 8 LPA

Catalog Specialist at Amazon

Data Analyst

Jay Prakash

Assistant Manager at Flipkart

Offered 30% Hike

Machine Learning Trainer at Vaidehi Software Technologies

Assistant Manager

Abhirami KV

DataScience Intern at Bayer

Offer 3.6 LPA

Fresher

Data Science Consultant

Tejasvi M

Data Scientist Trainee at ByteIQ Analytics pvt ltd

Offered 3.36 LPA

Fresher

Data Scientist Trainee

Anuj Kumar

Data Scientist Trainee at ByteIQ Analytics pvt ltd

Offered 3.36 LPA

Telecom Domain

Data Scientist Trainee

Student Reviews

International Student Reviews

I attended the 24-month Data Science Architect program at GEEKLURN and it was a fantastic experience. The curriculum is structured to help students master the essential vocabulary in Python (and R). All-in-all I felt the program delivered exactly the skills that I was looking to build. I'd recommend it highly to anybody who wants an effective foundation in Data Science and wants to Upscale.

Great place to learn Machine Learning/Data Science and related tools and technologies! Provides excellent placement support as well. I can say from my personal experience that it is the place to go for learning Data Science.

Alumni Success Meet at GeekLurn HQ

Award Felicitation to Mr. Sashi kiran by Management

Online interaction of Mr. Jayprakash, Alumni of Geeklurn working as Data Scientist @ Flipkart, Bengaluru

Felicitation by Management of Geeklurn to Mr. Sashi kiran, placed as Data Scientist @ Tata Consulting Services

Interaction of Mr. Sashi kiran with students at Alumni meet

Core Stages of Our DSAP Program

Stage 1

A conducive learning environment with 6 months of live interactive real-time training sessions with 1:1 mentorship offered by top industry experts who are working as Principal & Senior Data Scientists

Stage 2

After hands-on applied training, start working as a Data Science Intern on real-time industry assignments at an authorized GeekLurn incubation zone, which is recognised as a trusted research centre and is funded by IISC, IIM and ISB.

Stage 3

Get an opportunity to work on real- time industry-led projects officially funded by IISC, ISB and IIM. By displaying efficiency, calibre and good performance, you can be eligible for a scholarship of up to Rs. 2 lakh

Stage 4

The successful completion of the project will fetch you an Experience Certificate of 1+years issued by the respective research centre and will to help you add credible projects to your resume, which will prove to be advantageous during placements.

Stage 5

Along with the Experience Certificate, you can fine-tune your resume by getting certified as a Data Science Specialist accredited by top brands in the industry like Microsoft, IBM, HarvardX, DASCA, and SAS.

Stage 6

With 100% Job Guarantee & career support from industry experts in resume building and mock interviews practices, you will be eligible to get hired by more than 110+ Data Science Companies across India and earn up to Rs. 25 lacs/annum

Stage 7

From the 7th months after the completion of the live training session, you can start attending interviews while simultaneously working on research projects.

Data Science Architect Program Overview

GEEKLURN’s Data Science Architect Program is designed by industry experts and practitioners. It will provide you with tech-enabled job-relevant skills through the design, development and deployment of Big Data to convert it into real-time applications. The course is intended for both freshers and working professionals and will aim to positively and significantly impact their careers. Work on real-time case studies, & gain project mentorships & certifications through our 100% Job Guarantee program that, will make you job-ready and more relevant in the industry for various data science job roles.

With exposure to the key concepts and tools, towards the end of this program, you will have expertise in Testing, Analysis Modules, Hadoop Development, Administration, Statistical Computing, working with Real Analytics, analysing machine-generated data, and developing NoSQL Applications along with the mastering of Deep Learning and Artificial intelligence.

Who should take this Program

Students from NON-IT background like B.Com & MBA who wants to build a career in Business Analytics / Data Analytics / Data Science Project Management.
Any Engineering / BCA / MCA graduate who wants to build a career in Data Science.
IT and Computer Science Graduates who want to build a career in Data Science.
IT Professionals who wish to make a career shift to Data Sciences.
Data Science Professionals who wish to Upskill and become Data Architects.

What is the pre-requisite for taking up this training course?

There are no prerequisites as such to get on-board this program.

Why should you Take up This Program?

On a daily basis, over 5 billion consumers interact with data and that
number will increase to 6 billion by 2025, representing
three-quarters of the world’s population.
The Global Big Data Market is expected to reach $122 billion in revenue by
2025 – Frost & Sullivan
Research shows 94% of data science graduates
have gotten jobs in this field since 2011.

To reap out the benefits of this growing career opportunity,

embrace Data and secure your career as a Data Scientist by

getting certified with our 100% Job Guarantee Data Science Architect Program Today!.

After completion of this course, the Learner will be eligible for any of the following Job Roles:

Data Analyst
Data Architect
Business Intelligence Analyst
Data Scientist
Senior Data Scientist

Analytics Manager
Research Analyst
Data Science Architect
Machine Learning Engineer
Data Engineer

Tech Talks by Eminent Industry Experts

Mr. Havish Madhvapaty

Founder – Havish M Consulting

Mr. Priyank Ahuja

Product Manager – Accenture

Mr. Dhaval Thanki

Vice President, APAC, MEA – LogiNext

Mr. Anand Gurumurthy

CEO and Founder – Cosmitude Softwares Pvt Ltd

Mr Sujit Sukumaran

Founder & CEO – Optimus Management Consultants, Dubai, United Arab Emirates

Dr.Rajashekhar Karjagi

Manager – Analytics Solutions & POC – Accenture

Mr. Mukesh Jain

Global CTIO “Data”, Global Head of Innovation & VP Insights & Data – Capgemini

Mr. Joshua Devadas

Data Science Manager – Accenture

Mr. Nitin Bhosekar

Head Advanced Analytics, Artificial Intelligence, Automation, Big Data, DW Practice & Executive Vice President – Aress Software

Mr. Utsab Chakraborty

Senior Analytics Manager (Asst. Director) – Flipkart

Major Dr. Alexander M John

Vice President, Operations – Foundation AI

Mr. Manoj Kumar Rajendran

Principal Data Scientist – MiQ Digital India

Mr. Vinayakaram Gururajan

Data Scientist – Tata Consultancy Services

Mr Shriram Vasudevan

Project Manager – L&T Technology Service Ltd

Mr Balaji Subudhi

Senior Data Scientist – Mojro

Mr. Anish Raj

Human Resources Director

Mrs. Keru Chen

Analytics Manager – Product Grab, Singapore

Mrs. Mithila Harish

Data Scientist Micron Technology, Singapore

Mr. Ravi Parthasarathy

CEO/Director People Plus Consultants, Singapore

Advisors

A master in business analytics from Indian Institute of science, proficient in Statistics, Machine Learning, AI, Deep Learning, Quantum AI, Computer Vision, Speech, Language understanding, Knowledge Graph, Conversational computing, and IoT.

Revanth GuthalaDirector & Head for Complete Data Science Department of GEEKLURN

Mr. Adhvaith Reddy is an expert Data Scientist with more than 13 years’ worth of experience in the market research domain with industries like Genpact and Kantar. He is an expert in the subjects like Advanced Statistics, Predictive Analytics, Machine Learning, Data Pre-Processing, NLP, and Deep Learning. He also has significant expertise using tools like Excel, SPSS, R, Python, Quantum, Minitab, and Tableau. He believes that "No prior knowledge is required to join my classes.”

Mr.AdhvaithData Scientist with more than 13 years of experience in market research domain and industries like Genpact & Kantar

Thejasvi TV is a professional Data Scientist and Certified Six Sigma Black Belt (ASQ) with over 15 years of expertise. He has extensive knowledge of emerging analytics, data mining developments, and machine learning techniques/algorithms with practical expertise in R and KNIME programming languages.

Thejasvi TVSr.Data Scientist at Motorola

Udayan Goswami is a University of London Data Science expert. Python, PySpark, Hive(basic), R and SPSS, SQL, Spark, and Retail Banking are just some of his skills. He has a solid statistical understanding and a wealth of experience in constructing predictive and statistical models.

Udayan Goswami Sr.Data Scientist at Verizon

Murthy Adivi holds an MBA with 8.5 years of expertise in reporting, analytics, people management, operational and functional excellence, business intelligence, project management, and marketing research. He is a Subject Matter Expert with experience in Statistical Tools (SAS, R, and Minitab), SQL, Vertica and primary research (NPS).

Murthy Adivi Data Science Trainer at Dell EMC

Advisors

Revanth GuthalaDirector & Head for Complete Data Science Department of GEEKLURN

Mr.AdhvaithData Scientist with more than 13 years of experience in market research domain and industries like Genpact & Kantar

Thejasvi TVSr.Data Scientist at Motorola

Udayan Goswami Sr.Data Scientist at Verizon

Murthy Adivi Data Science Trainer at Dell EMC

Main Syllabus

Download Brochure

Introduction to Data Science

Learning Objectives: Business Intelligence vs Data Analysis vs Data Scientist

Data Scientist Roles

Different Disciplines of Data Science:

Machine Learning
Natural Language Processing
Deep Learning

Applications of Machine Learning

Why Machine Learning is the Future

Creating the “Hello World” code
Demonstrating Conditional Statements
Variables
Demonstrating Loops

What are Prerequisites for Data Science?

Statistics
Python Essentials for Data Science

Statistics & Probability

Descriptive Statistics and Inferential Statistics
Sample and Population
Variables and Data types
Percentiles
Measures of Central Tendency
Measures of Spread
Skewness, Kurtosis
Degrees of freedom
Variance, Covariance, Correlation
Descriptive statistics and Inferential Statistics in Python
Test of Hypothesis
Confidence Interval
Sampling Distribution
Standard Probability Distribution Functions
Bernoulli, Binomial-Distributions
Normal Distributions

EDA and Data Visualization

Data Transformations
Outlier Detection and Management
Charts and Graphs
One Dimensional Chart
Box plots
Bar graph
Histogram
Scatter plots
Multi-Dimensional Charts

Introduction to Python

Learning Objectives: You will get a brief idea of what Python is and touch on the basics.

Topics:

Overview of Python
Diﬀerent Applications where Python is used
Values, Types, Variables
Conditional Statements
Command Line Arguments
The Companies using Python
Discuss Python Scripts on UNIX/Windows
Operands and Expressions
Loops
Writing to the screen

Hands-On/Demo:

Creating the “Hello World” code
Demonstrating Conditional Statements
Variables
Demonstrating Loops

Skills:

Fundamentals of Python Programming

Deep Dive – Functions, OOPs, Modules, Errors, and Exceptions

Learning Objectives: In this module, you will learn how to create generic Python scripts, how to address errors/exceptions in code and ﬁnally how to extract/ﬁlter content using regex.

Topics:

Functions
Global Variables
Lambda Functions
Standard Libraries
The Import Statements
Package Installation Ways
Handling Multiple Exceptions
Function Parameters
Variable Scope and Returning Values
Object-Oriented Concepts
Modules Used in Python
Module Search Path
Errors and Exception Handling

Hands-On/Demo:

Functions – Syntax, Arguments, Keyword Arguments, Return Values
Sorting – Sequences, Dictionaries, Limitations of Sorting
Packages and Module – Modules, Import Options, Sys Path
Lambda – Features, Syntax, Options, Compared with the Functions
Errors and Exceptions – Types of Issues, Remediation

Skills:

Error and Exception Management in Python
Working with Functions in Python

Data Manipulation

Learning Objectives: Through this module, you will understand in-detail about Data Manipulation

Topics:

Basic Functionalities of a data object
Concatenation of Data objects
Exploring a Dataset
Merging of Data Objects
Types of Joins on data Objects
Analysing a Dataset

Hands-On/Demo:

Pandas Function- Ndim(), axes(), values(), head(), tail(), sum(), std(), iteritems(), iterrows(), itertuples()
Aggregation
Merging
GroupBy operations
Concatenation
Joining

Skills:

Python in Data Manipulation

Introduction to Machine Learning with Python

Learning Objectives: In this module, you will learn the concept of Machine Learning and its types.

Topics:

Python Revision (Numpy, Pandas, scikit learn, matplotlib)
Machine Learning Use-Cases
Machine Learning Categories
Gradient Descent
What is Machine Learning?
Machine Learning Process Flow
Linear Regression

Hands-On/Demo:

Linear Regression – Boston Dataset

Skills:

Machine Learning Concepts
Linear Regression Implementation
Machine Learning Types

Supervised Learning - I

Learning Objectives: In this module, you will explore Supervised Learning Techniques and their implementation, for example, Decision Trees, Random Forest Classiﬁer, etc.

Topics:

What are Classiﬁcation and its use cases?
Algorithm for Decision Tree Induction
Confusion Matrix
What is Decision Tree?
Creating a Perfect Decision Tree
What is a Random Forest?

Hands-On/Demo:

Implementation of Logistic Regression
Random Forest
Decision Tree

Skills:

Supervised Learning Concepts
Evaluating Model Output
Implementing diﬀerent types of Supervised Learning Algorithms

Dimensionality Reduction

Learning Objectives: In this module, you will learn about the impact of dimensions within data. You will be taught to perform factor analysis using PCA and compress dimensions. Also, you will be developing the LDA model.

Topics:

Introduction to Dimensionality
PCA
Scaling Dimensional Model
Why Dimensionality Reduction
Factor Analysis
LDA

Hands-On/Demo:

PCA
Scaling

Skills:

Implementing Dimensionality Reduction Technique

Supervised Learning - II

Learning Objectives: In this module, you will learn Supervised Learning Techniques and their implementation,

Topics:

What is Naïve Bayes?
Implementing Naïve Bayes Classiﬁer
Illustrate how Support Vector Machine works?
Grid Search vs Random Search
How Naïve Bayes works?
What is Support Vector Machine?
Hyperparameter Optimization
Implementation of Support Vector Machine for Classiﬁcation

Hands-On/Demo:

Implementation of Naïve Bayes, SVM

Skills:

Supervised Learning Concepts
Evaluating Model Output
Implementing diﬀerent types of Supervised Learning Algorithms

Unsupervised Learning

Learning Objectives: In this module, you will learn about Unsupervised Learning and the various types of clustering that can be used to analyse the data.

Topics:

What is Clustering & its Use Cases?
How does the K-means algorithm work?
What is C-means Clustering?
How does Hierarchical Clustering works?
What is K-means Clustering?
How to do Optimal clustering
What is Hierarchical Clustering?

Hands-On/Demo:

Implementing K-means Clustering
Implementing Hierarchical Clustering

Skills:

Unsupervised Learning
Implementation of Clustering – Various Types

Association Rules Mining and Recommendation Systems

Learning Objectives: In this module, you will learn Association rules and their extension towards recommendation engines with the Apriori Algorithm.

Topics:

What are Association Rules?
Calculating Association Rule Parameters
How do Recommendation Engines work?
Content-Based Filtering
Association Rule Parameters
Recommendation Engines
Collaborative Filtering

Hands-On/Demo:

Apriori Algorithm
Market Basket Analysis

Skills:

Data Mining using Python
Recommender Systems using Python

Reinforcement Learning

Learning Objectives: In this module, you will learn about Unsupervised Learning and the various types of clustering that can be used to analyse the data.

Topics:

What is Reinforcement Learning?
Elements of Reinforcement Learning
Epsilon Greedy Algorithm
Q values and V values
α values
Why Reinforcement Learning
Exploration vs Exploitation dilemma
Markov Decision Process (MDP)
Q – Learning

Hands-On/Demo:

Calculating Reward
Calculating Optimal quantities
Setting up an Optimal Action
Discounted Reward
Implementing Q – Learning

Skills:

Implement Reinforcement Learning using Python
Developing Q – Learning model in Python

Time Series Analysis

Learning Objectives: In this module, you will learn about Time Series Analysis to forecast dependent variables based on time. You will be taught diﬀerent models for Time Series Modeling such that you analyse a real time-dependent data for forecasting.

Topics:

What is Time Series Analysis?
Components of TSA
AR Model
ARMA Model
Stationarity
Importance of TSA
White Noise
MA Model
ARIMA Model
ACF & PACF

Hands-on/demo:

Checking Stationarity
Implementing the Dickey-Fuller Test
Generating the ARIMA plot
Converting a non-stationary data to stationary
Plot ACF and PACF
TSA Forecasting

Skills:

TSA in Python

Model Selection and Boosting

Learning Objectives: In this module, you will learn about selecting one model over another. Also, you will learn about Boosting and its importance in Machine Learning. You will learn on how to convert weaker algorithms into stronger ones.

Topics:

What is the Model Selection?
Cross-Validation
How Boosting Algorithms work?
Adaptive Boosting
The need for Model Selection
What is Boosting?
Types of Boosting Algorithms

Hands-On/Demo:

Cross-Validation
AdaBoost

Skills:

Model Selection
Boosting algorithm using Python

Sequences and File Operations

Learning Objectives: Learn diﬀerent types of sequence structures, related operations and their usage. Also learn diverse ways of opening, reading, and writing to ﬁles.

Topics:

Python ﬁles I/O Functions
Strings and Related Operations
Lists and Related Operations
Sets and Related Operations
Numbers
Tuples and related operations
Dictionaries and related operations

Hands-On/Demo:

Tuple – properties, Related Operations, compared with a list
Dictionary – properties, related operations
List – properties, related operations
Set – properties, Related Operations

Skills:

File Operations using Python
Working with data types of Python

Introduction to NumPy, Pandas and Matplotlib

Learning Objectives: This module helps you get familiar with the basics of statistics, diﬀerent types of measures and probability distributions, and the supporting libraries in Python that assist in these operations. Also, you will learn in detail about data visualisation.

Topics:

NumPy – Arrays
Indexing Slicing and iterating
Pandas – Data Structures & Index Operations
Matplotlib Library
Markers, Colours, Fonts and Styling
Contour Plots
Operations on Arrays
Reading and Writing Arrays on Files
Reading and Writing Data from Excel/CSV formats into Pandas
Grids, Axes, Plots
Types of Plots – Bar Graphs, Pie Charts, Histograms

Hands-On/Demo:

NumPy Library- Creating NumPy array, operations performed on NumPy array
Matplotlib – Using Scatterplot, histogram, bar graph, a pie chart to show information, Styling of Plot
Pandas Library- Creating series and data frames, Importing and exporting data

Skills:

Probability Distributions in Python
Python for Data Visualisation

Which Case Studies will be a part of this Python Certiﬁcation Course ?

Case Study 1: Maple Leaves Ltd is a start-up company that makes herbs from diﬀerent types of plants and leaves. Currently, the system they use to classify the trees that they import in a batch is quite manual. A labourer from his experience decides the leaf type and subtype of the plant family. They have asked us to automate this process and remove any manual intervention from this process.
Case Study 2: BookRent is the largest online and oﬄine book rental chain in India. The company charges a ﬁxed fee per month plus rental per book. So, the company makes more money when the user rents more books. You are an ML expert and must model a recommendation engine so that the user gets a recommendation of books based on the behaviour of similar users. This will ensure that users are renting books based on their individual tastes. The company is still unproﬁtable and is looking to improve both revenue and proﬁt. Compare the Error using two approaches – User-Based vs Item Based. You have to classify the plant leaves by various classiﬁers from diﬀerent metrics of the leaves and to choose the best classiﬁer for future reference.
Case Study 3: Handle missing values and ﬁt a decision tree and compare its accuracy with a random forest classiﬁer. Predict the survival of a horse based on various observed medical conditions. Load the data from ‘horses.csv’ and observe whether it contains missing values. Replace the missing values by the most frequent value in each column. Fit a decision tree classiﬁer and observe the accuracy. Fit a random forest classiﬁer and observe the accuracy.
Case Study 4: Principal component analysis using scikit learn. Load the digits dataset from sklearn and write a helper function to plot the image. Fit a logistic regression model and observe the accuracy. Using scikit learn to perform a PCA transformation such that the transformed dataset can explain 95% of the variance in the original dataset. Compare it with a model and also comment on the accuracy. Compute the confusion matrix and count the number of instances that have gone wrong. For each of the wrong sample, plot the digit along with the predicted and original label.
Case Study 5: Handling GIS data and working with maps. Creating, cleaning, collating and visualizing maps of India at different levels – state, district, taluka, and villages. Using Geo Pandas, Mapviz, and Leaflet in Python to perform spatial analytics and visualizing statistics with geographical context. Using public data of government expenditure, identify the areas and districts with the highest expenditure per capita in different states and all over India.

Which kind of projects will be a part of this Python Certiﬁcation Course?

Project #1: Industry: Social Media

Problem Statement: You, as ML expert, have to do analysis and modeling to predict the number of shares of an article given the input parameters.
Actions to be performed: Load the corresponding dataset. Perform data wrangling, visualisation of the data and detect the outliers, if any. Use the plotly library in Python to draw useful insights out of data. Perform regression modeling on the dataset as well as decision tree regressor to achieve your Learning Objectives. Also, use scaling processes, PCA along with boosting techniques to optimise your model to the fullest.

Project #2: Industry: FMCG

Problem Statement: You as an ML expert have to cluster the countries based on various sales data provided to you across years.
Actions to be performed: You have to apply an unsupervised learning technique like K means or Hierarchical clustering so as to get the ﬁnal solution. But before that, you have to bring the exports (in tons) of all countries down to the same scale across the years. Plus, as this solution needs to be repeatable you will have to do PCA so as to get the principal components that explain the max variance.

Natural Language Processing with deep Learning in Python

WHAT YOU WILL LEARN

Understand and implement word2vec
Understand the CBOW method in word2vec
Understand the skip-gram method in word2vec
Understand the negative sampling optimisation in word2vec
Understand and implement GloVe using gradient descent and alternating least squares
Use recurrent neural networks for parts-of-speech tagging
Use recurrent neural networks for named entity recognition
Understand and implement recursive neural networks for sentiment analysis
Understand and implement recursive neural tensor networks for sentiment analysis

Natural Language Processing in TensorFlow

WHAT YOU WILL LEARN

Build natural language processing systems using TensorFlow
Process text, including tokenisation and representing sentences as vectors
Apply RNNs, GRUs, and LSTMs in TensorFlow
Train LSTMs on existing text to create original poetry and more

Module 1 : Introduction to Natural Language Processing

Getting Started
Knowing each other
Welcome to the Course
About the Course
Introduction to Natural Language Processing
Exercise: Introduction to Natural Language Processing
Podcast with NLP Researcher Sebastian Ruder

Module 2 : A Refresher to Python

Installation steps for Linux
Installation steps for Mac
Installation steps for Windows
Packages Installation
Introduction to Python
Variables and Operators
Exercise: Variables and Operators
Python Lists
Exercise: Python Lists
Dictionaries
Exercise: Dictionaries
Conditional Statements
Exercise: Conditional Statements
Loops
Exercise: Loops
Functions
Python Functions Practice
Exercise: Functions
Packages
Exercise: Packages
Files
Exercise: Files

Module 3 : Learn to use Regular Expressions

Welcome to Module
Understanding Regular Expression
Implementing Regular Expression in Python
Exercise: Implementing Regular Expression in Python
Regular Expressions in Action

Module 4 : First Step of NLP - Text Processing

Welcome to Module
Tokenization and Text Normalisation
Exercise: Tokenisation and Text Normalisation
Exploring Text Data
Part of Speech Tagging and Grammar Parsing
Exercise: Part of Speech Tagging and Grammar Parsing
Implementing Text Pre-processing Using NLTK
Exercise: Implementing Text Pre-processing Using NLTK
Natural Language Processing Techniques using spaCy

Module 5 : Extracting Named Entities from Text

Welcome to Module
Understanding Named Entity Recognition
Exercise: Understanding Named Entity Recognition
Implementing Named Entity Recognition
Exercise: Implementing Named Entity Recognition
Named Entity Recognition and POS tagging using spaCy
POS and NER in Action: Text Data Augmentation
Assignment: Share your learning and build your profile

Module 6 : Feature Engineering for Text

Introduction to Text Feature Engineering
Count Vector, TFIDF Representations of Text
Exercise: Introduction to Text Feature Engineering
Understanding Vector Representation of Text
Exercise: Understanding Vector Representation of Text
Understanding Word Embeddings
Word Embeddings in Action – Word2Vec
Word Embeddings in Action – GloVe

Module 7 : Mastering the Art of Text Cleaning

Introduction to Text Cleaning Techniques Part 1
Exercise: Introduction to Text Cleaning Techniques Part 1
Introduction to Text Cleaning Techniques Part 2
Exercise: Introduction to Text Cleaning Techniques Part 2
Text Cleaning Implementation
Exercise: Text Cleaning Implementation
NLP Techniques using spaCy

Project I – Social Media Information Extraction

Module 9 : Interpreting Patterns from Text - Topic Modelling

Introduction to Topic Modelling
Exercise: Introduction to Topic Modelling
Understanding LDA
Exercise: Understanding LDA
Implementation of Topic Modelling
Exercise: Implementation of Topic Modelling
LSA for Topic Modelling

Module 10: Project II - Categorisation of Sports Articles

Understanding the Problem Statement
Importing Dataset
Text Cleaning and Pre-processing
Categorising Articles using Topic Modelling

Module 11.1 : Machine Learning Algorithms

Types of Machine Learning Algorithms
Logistic Regression
Decision Tree
Naive Bayes
SVM (Support Vector Machine)
Random Forest

Module 11.2 : Understanding Text Classification

Overview of Text Classification
Exercise: Overview of Text Classification
Assignment: Share your learning and build your profile

Module 12.1 : Introduction to Deep Learning (Optional)

Getting started with Neural Network
Exercise: Getting started with Neural Network
Understanding Forward Propagation
Exercise: Forward Propagation
Math Behind forwarding Propagation
Exercise: Math Behind forwarding Propagation
Error and Reason for Error
Exercise: Error and Reason for Error
Gradient Descent Intuition
Understanding Math Behind Gradient Descent
Exercise: Gradient Descent
Optimiser
Exercise: Optimiser
Back Propagation
Exercise: Back Propagation
Why Keras?
Exercise: Why Keras?
Building a Neural Network for Text Classification
Why CNN?
Exercise: Why CNN?
Understanding the working of CNN Filters
Exercise: Understanding the working of CNN Filters
Introduction to Padding
Exercise: Introduction to Padding
Padding Strategies
Exercise: Padding Strategies
Padding Strategies in Keras
Exercise: Padding Strategies in Keras
Introduction to Pooling
Exercise: Introduction to Pooling
CNN architecture and it’s working
Exercise: CNN architecture and it’s working

Module 12.2 : Deep Learning for NLP

Deep Learning for NLP Part 1
Exercise: Deep Learning for NLP Part 1
Deep Learning for NLP Part 2
Exercise: Deep Learning for NLP Part 2
Text Generation Using LSTM
Exercise : Text Generation Using LSTM

Module 13 : Project III – SMS Spam Classification

Dataset download
Text Cleaning
Feature Engineering
Advanced Feature Engineering
Combining Features
ML Classifier
Spam Classification using Deep Learning

Module 14 : Project IV – Hate Speech Classification

Project III

Module 15 : Project V – Building Auto-Tagging System

Overview of Auto-Tagging System
Introduction to Dataset and Performance Metrics
Auto-Tagging Implementation Using Machine Learning Part-1
Auto-Tagging Implementation Using Machine Learning Part-2
Auto-Tagging Implementation Using Deep Learning

Module 16 : Recurrent Neural Networks

Why RNN
Introduction to RNN: Shortcomings of an MLP
Introduction to RNN: RNN Architecture
Training an RNN: Forward propagation
Training an RNN: Backpropagation through time
Need for LSTM/GRU
Long Short Term Memory (LSTM)
Gated Recurrent Unit (GRU)
Project: Categorisation of websites using LSTM and GRU I
Dataset and Notebook
Project: Categorisation of websites using LSTM and GRU II

Module 17 : Introduction to Language Modelling in NLP

Overview: Language Modelling
What is a Language Model in NLP?
N-gram Language Model
Implementing an N-gram Language Model – I
Implementing an N-gram Language Model – II
Neural Language Model
Implementing a Neural Language Model

Module 18 : Sequence-to-Sequence Modelling

Intuition Behind Sequence-to-Sequence Modelling
Need for Sequence-to-Sequence Modelling
Understanding the Architecture of Sequence-to-Sequence
Understanding the Functioning of Encoder and Decoder
Case Study: Building a Spanish to English Machine Translation Model
Pre-processing of Text Data
Converting Text to Integer Sequences
Model Building and Inference

Module 19 : Project VI - Summarisation of Customer Reviews

Introduction
Pre-processing and Feature Creation
Model Building and Summary Genera

Module 20 : Project VII - Build your first Chatbot

Introduction
About this module
Overview of Conversational Agents
Project – Foodbot
Overview of Rasa Framework
System Setup
Rasa NLU: Understanding user intent from a message
Rasa NLU: Extracting intents from a user’s message
Rasa Core: Making your chatbot conversational
Working with Zomato API
Create a Workspace in Slack
Deploying to Slack
Assignment: Share your learning and build your profile

Module 21 : Bonus Section (Advance NLP tools)

Getting started with Bonus Section
Text Classification & Word Representations using FastText (An NLP library by Facebook)
Introduction to Flair for NLP: A Simple yet Powerful State-of-the-Art NLP Library
Introduction to Stanford NLP: An Incredible State-of-the-Art NLP Library for 53 Languages (with Python code)
A Step-by-Step NLP Guide to Learn Elmo for Extracting Features from Text
Tutorial on Text Classification (NLP) using ULMFiT and fastai Library in Python
8 Excellent Pretrained Models to get you started with Natural Language Processing (NLP)
Geo-coding using NLP by Shantanu Bhattacharyya and Farhat Habib
Demystifying the What, the Why and How of Chatbot by Sonny Laskar
Sentiment Analysis using NLP and Deep Learning by Jeeban Swain
Identifying Location using Clustering and Language Model – By Divya Choudhary
Building Intelligent Chatbots from Scratch

Project : VIII Stock Prices Predictor

This is another interesting machine learning project idea for data scientists/machine learning engineers working or planning to work with the finance domain. Stock prices predictor is a system that learns about the performance of a company and predicts future stock prices. The challenges associated with working with stock price data are that it is very granular, and moreover there are different types of data like volatility indices, prices, global macroeconomic indicators, fundamental indicators, and more. One good thing about working with stock market data is that the financial markets have shorter feedback cycles making it easier for data experts to validate their predictions on new data. To begin working with stock market data, you can pick up a simple machine learning problem like predicting 6-month price movements based on fundamental indicators from an organisations’ quarterly report. You can download Stock Market datasets from Quandl.com or Quantopian.com.

Project : IX Human Activity Recognition using Smartphone Dataset

The smartphone dataset consists of fitness activity recordings of 30 people captured through smartphone-enabled with inertial sensors. The goal of this machine learning project is to build a classification model that can precisely identify human fitness activities. Working on this machine learning project will help you understand how to solve multi-classification problems. One can become a master of machine learning only with lots of practice and experimentation. Having theoretical surely helps but it’s the application that helps progress the most. No amount of theoretical knowledge can replace hands-on practice. There are many other machine learning projects for beginners like the ones mentioned above that you can work with. However, it will help if you familiarise yourself with the above-listed projects first. If you are a beginner and new to machine learning then working on machine learning projects designed by industry experts at DeZyre will make some of the best investments of your time. These machine learning projects have been designed for beginners to help them enhance their applied machine learning skills quickly whilst giving them a chance to explore interesting business use cases across various domains – Retail, Finance, Insurance, Manufacturing, and more. So, if you want to enjoy learning machine learning, stay motivated, and make quick progress then DeZyre’s machine learning interesting projects are for you. Plus, add these machine learning projects to your portfolio and land a top gig with a higher salary and rewarding perks.

Project : X Learn to build Recommender Systems with Movielens Dataset

From Netflix to Hulu, the need to build an efficient movie recommender system has gained importance over time with increasing demand from modern consumers for customised content. One of the most popular datasets available on the web for beginners to learn how to build recommender systems is the Movielens Dataset which contains approximately 1,000,209 movie ratings of 3,900 movies made by 6,040 Movielens users. You can get started working with this dataset by building a world-cloud visualisation of movie titles to build a movie recommender system.

This course comprises of 40 case studies that will enrich your learning experience. In addition, we also have 4 Projects that will enhance your implementation skills. Below are a few case studies, which are part of this course:

Case Study 1: Maple Leaves Ltd is a start-up company that makes herbs from diﬀerent types of plants and leaves. Currently, the system they use to classify the trees which they import in a batch is quite manual. A laborer from his experience decides the leaf type and subtype of the plant family. They have asked us to automate this process and remove any manual intervention from this process.

Project : XII AutoBot

Build a system that can have a conversation with you. The user types messages and your system replies based on the user’s text. Many approaches here … you could use a large twitter corpus and do language similarity.

Cloud Computing & Deployment of Machine Learning Modals to Cloud

Flask basics
Deployment of the model on Heroku
AWS basics
- S3
- EC2
- AWS Lambda
Deployment of the model on EC2
Deployment on AWS Lambda (Optional)
Google Cloud Platform Basics
Deployment of the Model on GCP
Microsoft Azure basics
Deployment of the Model on Azure
Pyspark Basics
DeVops Concepts

Business & Data Analytics

Data Visualization with Tableau & Power BI
SQL
MSOFFICE
Introduction to Operational , HR, Finance, Marketing Analytics

Data Science Project Management

Leading Data Science Teams & Processes
Exploring Methodologies
How to manage data science projects and lead a data science team
Agile Data Science
Scrum Data Science
Emerging Approaches – Microsoft TDSP
Data Science Methodology understanding
Business & Data understanding
Modelling & Evaluation
Plan Deployment
Data Science Project Report

Big Data , Spark & Kafka , MongoDB, CICD

Introduction to Bigdata & HDFS along with Linux concepts , its importance in data science
Core Components of Hadoop
HDFS Architecture
HDFS Commands

MapReduce Concepts & programming

Apache Scoop Fundamentals & basics
Apache Spark fundamentals & advanced concepts and its importance in data science
Introduction to Kafka
Bigdata on Cloud and its importance in Data Science

Addon Syllabus

Hello, world!

Program Highlights

24 Months Program Duration
6 Months of live Interactive Classes by Principal or Senior Data Scientists across Industry
18 months of Sponsored Research Project Work at GEEKLURN AI
Hackathons (Cash rewards upto Rs.1 Lakh will be awarded to top performers)
Remote or Hybrid International internship opportunities in USA, UK, Europe & APAC for Merit Students with 2 years of Project Experience Certificate
Job Guarantee* Program In collaboration with Top HR Consulting Firms across PAN INDIA

Our Data Science & AI Community

Trusted by 100+ Data Science Experts
Tech Talks, Webinars from Data Science Heads @ Forbes Technology Council & Reputed MNC’s
100+ Data Science Meetups
40+ powerful tools & resource to achieve Data Science goals.
Innovative AI & Big Data conferences, Forums led by acknowledged speakers and proficient analysts.

Real-Time Projects

Gain upto 2 years of Project Experience Certificate working on International Client (USA, Europe and APAC) Projects from Geeklurn AI

Project Certificate

Data Science Architect Project Certification from Top MNCs

Complete the designated project to acquire this prestigious certification from reputed research centres.

Program Certificates

Course Features

Tools Covered

Program Fees

₹ 2,78,000

Inclusive of 18% GST.

24-Months Program Duration
- 6 Months of live interactive classes
- 18 Months of Sponsored Project Work
320+ Hours Live Training Sessions
Gain 1.5 years of real-time Project Experience Certificate
Scholarships from day one up to Rs 2 lakhs
50+ Sponsored Funded Research Projects
Get Certified from Top Companies
100% Job Guarantee Program

Apply Now

First Name

Last Name

Phone

Accept to Contact

I authorize GeekLurn and its representative to contact me with updates and notifications via Email, SMS, WhatsApp, and Call. This will override the registry on DND/NDNC. By submitting this form, you agree to our Terms and Conditions.

Flexible Payment options Available from Finance Partners.

We proudly partner with the safe, secure and best financial services in the industry.

Frequently Asked Questions

Is there any Data Science course with a placement guarantee?

At GeekLurn, our Data Science Architect course has unique modules crafted in line with the latest industry trends. Being a member of NASSCOM & and an IBM partner, our corporate connect ensures our students are placed in top consulting firms among others.

What kind of jobs are available for data science course freshers?

With a robust course that has its modules crafted as per latest industry trends, a wide range of opportunities open up for freshers. Research data reveals that India recruits a high number of employees in the field of data science, second only to the United States. The demand for data experts span a wide range of industries including IT, Finance, Retail, E-commerce, Consulting & more. At GeekLurn we have a strong industry connect and we offer 100 percent job placements in collaboration with the top consulting firms to our fresh graduates.

What kind of job can I get after the Data Science program?

Data Science opens up a wide range of opportunities across multiple industries all of which need to mine data into actionable insights for business growth. Whether it is Information Technology, Banking & Finance, Retail or Consulting, the demand for data science professionals is on the rise. After a robust data science course that is crafted in line with the latest industry trends you can look at building a career as a Business Analyst, Data Architect, Machine Learning Engineer and more.

Business and financial analytics
Bioinformatics
Health & pharma
Robotics & AI
Chip design
Cloud computing
Marketing analysis
Agriculture
Gaming
Education

Which is the best Data Science course with job guarantee?

At GeekLurn, we offer extremely robust courses that are in line with the latest industry trends and that keep you ahead of the curve. The courses come with the opportunity to conduct on ground research and also offer the opportunity to interact with eminent data scientists. With our strong industry connect we are able to offer 100 percent placement in collaboration with the top consulting firms across India.

Which course is best for a Data Scientist job?

The Data Science Architect Program at GeekLurn comes with robust modules that are in line with the latest industry trends and give you a definite edge. In addition the course offers you the opportunity to conduct on ground research and also interact with eminent data scientists. With GeekLurn’s strong industry connect, its membership with NASSCOM & IBM partnership, we offer 100 percent placement to our graduates.

Students are also provided career guidance and mentoring by industry professionals to face job interviews that GeekLurn provides through 400+ hiring partners and top HR companies in the industry.

🎉 🎉 GEEKLURN has partnered with OPPTY.AI, Singapore to provide international opportunities to students. Learn More 🎉 🎉

Data Science Architect Program

Data Science Architect Program

Geeklurn is now a Member of NASSCOM

GeekLurn is now a Member of NASSCOM & IBM Partner

Geeklurn is now a Member of NASSCOM

GeekLurn is now a Member of NASSCOM & IBM Partner

2200+

Online

24 Months

110+

EMI Options

GeekLurn is now a Member of NASSCOM & IBM Partner

Program Overview

Key Highlights

24-Months Course Duration

320+ hours Live Interactive Sessions with Eminent Data Scientists.

Gain 1.5 years of real-time sponsored Project Expierence Certificate from recognised research centres

Start paying your EMI only after placements at the end of the course

100% Job Guarantee Program in collaboration with Top HR Consulting Firms across India

Scholarships from day ONE up to Rs 2 lakhs based on the type pf Research Projects

Get the opportunity to conduct research work with Singapore based GEEKLURN AI

50+ Sponsored & Funded Research Projects

Get Certified from Top Companies (IBM, Accenture, Microsoft, Oracle...)

Data Science Job Guarantee Overview

Job Guarantee*

4 million+ Data Jobs

1-on-1 Career Mentorship

Easy Financing Options

Placement Stats

30% Average Salary Hike

16 LPA Average Salary

8000 + Jobs Sourced

500+ Hiring Partners

Our Authorised Hiring Partners

Our Alumni

Mahadev Kadam

Data Scientist at Hexaware Technologies

Offered 29 LPA

Product Development Architect - Senior Specialist In FIS

Data Scientist

Teena Prithviraj Jain

Data Scientist at Hexaware Technologies

Offered 8 LPA

Adinath Solutions Logo Software Developer at Adinath Solutions

Data Scientist

Bhargav Nishanth

Data Scientist at Hexaware Technologies

Offered 5.5 LPA

Technical Support Specialist at Tech Mahindra

Data Scientist

Shashi Kiran

Data Scientist at Tata Consultancy Services

Offered 14 LPA

Data Analyst at HCL

Machine Learning Engineer

Vamshi Krishna

Data Scientist at Accenture

Offered 18 LPA

Software Test Engineer

Data Scientist

Shammer Khan

Professional 1 Product Delivery at DXC Technology

Offered 9 LPA

Data Science Trainee

Professional 1 Product Delivery

Chethan

Data Scientist at smart Data Enterprises Inc.

Offered 8 LPA

Design Engineer at Vengala Engineering Design Services (VEDS)

Data Scientist

Aathir Laeeque

Data Scientist at Brightmoney

Offered 8 LPA

Catalog Specialist at Amazon

Data Analyst

Jay Prakash

Assistant Manager at Flipkart

Offered 30% Hike

Machine Learning Trainer at Vaidehi Software Technologies

Data Scientist at
Tata Consultancy Services

Data Scientist
at Accenture

Data Scientist at
Brightmoney