Full Stack Data Science Program

PROGRAM OVERVIEW

Full stack program is a unique offering from Emerging India. This gives an end-to-end understanding of datascience covering data analytics, Machine Learning, visualization, soft skills and much more. To add flavor to the course content this also includes an extensive module on SAS + SQL

 

Quick Contact




CURRICULUM

MODULE 1 : Data Science with R, Python & Machine Learning

SQL

Basics of Database, Types of Databases, DDL, DML, Select, Insert, Update, Delete
Joins, nth highest
Assessments:- Assignment of SQL

R Programming 

 Vector, Matrix, Array, List, DataFrame, Factors
File Handling (upload, download), connecting with database
Functions (Custom/user defined)
Apply Family, Graphs with R
Assessments:- Assignment of R

Python

IDE of Python, Basics of Python, Control Structure, String, Number
List, Dictionary, Sets, Tuples, file handling, functions
Numpy, regular expressions
Pandas, Graphs( Matplotlib), init
Assessments:- Assessment of Python

Statistics 

Descriptive Analytics (mean, median, mode, AM, GM, HM, Quartile, Percentile, Decile, Box plot, range, MD, SD, Variance)
Covariance, Correlation, skewness, kurtosis
Hypothesis Testing (p value)( T-test, Z-test, F-test)
Regression Analysis( simple, mutiple, linear, non linear)
Assessments:- Assessment of Statistics

ML 

Classification( training, test, ROC, confusion matrix),KNN, Decision Tree
Probability, Bayes Theorem, Naive Bayes classifier
ANN, CNN, RNN
Clustering(K-Means), Association Mining
Time Series Analysis( differencing, MA, AR, ARMA, ARIMA, exponential smoothing)
Assessments:- Assessment of ML

Excel  

VBA, Graphs

Excel 

Formulas in Excel
Assessments:- Excel Assignment

Documents  

Word, Microsoft powerpoint, Microsoft Vision
Assessments:- Assessment of Word Doccuments

Assignment

10 Assignment;One Each Topics

Test Series  

2 Full Test

MODULE 2 : Big Data with Hadoop and Spark

Hadoop 

HDFS Architecture( HDFS, YARN, MapReduce, namenode, datanode)/Cloudera
HDFS Commands
HIVE Architecture
Hive Query Language ( DDL, DML, joins, map, dictionary)
HBASE (Hbase architecture)
PIG
Assessments:- Assessment of Hadoop
Assessment of Hive

Spark   

Flume/Sqoop/ Oozie
Spark Architecture
Spark Streaming
SparkSQL/Pyspark/SparkR
Assessments:- Assessment of Spark
Assessment of Flume & Sqoop

Assignment   

04 Assignment;One Each Topics

Test Series   

01 Full Test

MODULE 3 : Data Visulaization with Tableau

Tableau 

Tableau Desktop, Tableau products, Dimensions, Measures, Filters, Marks, Dual Axis,
LOD expressions, Storyline, maps, market maps, Tableau preference scripts, connecting to different data sources
Dashboard Designing, Dashboard Actions, Extract, Live connection
Assessments:-  Tableau Assessment

Assignment 

02 Assignment;One Each Topics

Test Series  

01 Full Test

MODULE 4 : Soft Skills Training

Soft Skills  

Understands the Verticals-Engineering, Financial & Others
Manage your work to meet requirements
Work Effectively with Colleagues
Maintain Healthy, Safe & Secure Working Environment
Provide Data/Information in Standard Formats
Working with documents
Develop Knowledge, Skill & Competence
Assessments:-  Assessment-01
Assessment-02
Assessment-03
Assessment-04

Assignment  

04 Assignment;One Each Topics

Test Series 

02 Full Test

MODULE 5 : Data Analytics with SAS & SQL

SAS-Data Structures  

SAS data sets, libraries, combining data sets, accessing data from external sources like Excel workbook, manipulating SAS values, exporting data to create raw files, processing observations and variables.
Assessments:-  Assessment of SAS

SAS-Managing Data 

Sorting, conditional execution, assignment statements, modifying variable attributes, totalling and sub totalling, functions (numeric, character and dates), coercion, loops, data validation and data wrangling.
Assessments:-  Assessment of SAS

SAS-Reports  

Using procedures like PRINT to generate list reports, summary reports, frequency tables, report enhancement, user-defined formats, titles, footnotes and SAS system reporting, ODS statements.
Assessments:-  Assessment of SAS

SAS-Error Handling  

Identify and resolve programming logic errors, recognize and correct syntax errors, Examine and resolve data errors.
Assessments:-  Assessment of SAS

SQL  

Basics of Database, Types of Databases, DDL, DML, Select, Insert, Update, Delete
Joins, nth highest
Assessments:-  Assessment of SQL

Assignment  

06 Assignment;One Each Topics

Test Series  

01 Full Test

PROJECT & TRAINING

Our live-projects offering prepares you for a range of analytics offerings in data science domain. For this course we would work on these projects:

  1. Time Series Analysis:  Forecasting the stock price data using different time series algorithms like ARIMA, HW, EWMA etc in R
  2. Text Analytics:  Appling Text Analytics on text data (twitter, online) and calculating polarization, complex words, fog index, text clusteting, text classification in Python
  3. CNN on Image Data:  Applying deep learning in image processing, classification and identifying features in Python
  4. Data Cleaning and Manipulation in R and preparing different visualization on Retail data
  5. Building Data Pipeline (Hadoop|Hive|Spark):  Building a data pipeline using RDBMS, creating aggration engine in HIVE and Spark-SQL and final visualization in Tableau of the aggregated data using BFSI data
  6. Data visualization:  Country wise population and literacy rate analysis at district level. Geo-Spatial Analysis, action filters, basic design elements, calculated fields etc would be covered in this exercise
  7. Soft Skills:  Group projects where students will apply their learn skills by planning, organizing and documenting and giving presentation on the given topics and training they have undertaken.
  8. SAS/SQL:  Implementation of statistical algorithms using SAS using one industry data (for ex: retail industry data)

SAMPLE CERTIFICATE

 

USP OF PROGRAM

Curriculum created by the industry experts in collaboration with NASSCOM keeping in mind the industry needs
State-of-the-art infrastructure and fully equipped labs
Training delivered by certified and experienced trainers and industry experts
100% Instructor-Led Classroom Training
NASSCOM SSC official study material
Assessment & Certification from NASSCOM IT-ITeS
Globally Recognized Certificate
Interview Preparation.
Placement Assistance.

FAQ

What is this program about ?

All the tools which are necessary for analytics/ data science profile like programming and data modelling tools (R, Python, SAS), Big data tools (Hadoop, Hive, Pig, Spark, and many more), Visualization and storytelling tools (Tableau, MS Excel etc.) and data management tools (SQL and MS Excel) will be covered in detail. For the topics, please look at the detailed curriculum, however as stated above, everything which is required to start analytics/ data science journey.

What is course duration ?

The course duration is approximately 500 hours

What all topics and tools will be covered in this program ?

This will cover a range of platforms like Python, R, Hadoop along with Machine learning, Big Data and other key elements of Data Science

Why join this course ?

Data Science is one of most pursued careers these days and requires a holistic understanding to deliver right business solutions. This course will prepare you for all these business challenges you will encounter in your career as a data analyst.