sem 5 PDS

python , javascript, R ,SQL

This graph represents the minimum, maximum, median, first quartile and third quartile in the data set

 python 

introduction of python 

  • open source, interpreted, high level , general purpose programming language.
  • easiest to learn and nicest to use
  • fewer lines of code compare to other languages

Advantages 

  • easy syntax 
  • readability 
  • high level language 
  • its free and open source 
  • cross platform
  • widely supported 
  • its safe
  • huge amount of additional open source library 

data stuctures in python -

List - list is data structure that holds ordered collection of item.
tuple - tuple is imutable ordered sequence of zero or more objects . it allows dublicate values like list
set- is a muteble data structure that holds unordered collection of unique items. {}
directory - is unordered collection of zero or more key-value pair . its mutable 

Data Science -
it is branch of computer science where we study how to store , use and analyze data for deriving information from it .

it is about gethering , analizing and desision making .


Creating data science pipeline -
- preparing the data
-performing data analisis 
- learning from data
- visualizing 
- obtaining insights and data product


Numpy

NumPy stands for Numerical Python.

NumPy is a Python library used for working with arrays.

It has functions for working with linear algebra, fourier transform, and matrices.


Using numpy , mathematical and logical operation on arrays can be perfomed very easily

pandas

Pandas is fast , powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of python programming language .

series - is one dimensional array with axis label 
data frames - are two dimensional data structure 

Beautiful soup -
is a library that makes easy to scrape information from web pages .

Comments