sem 5 PDS
python , javascript, R ,SQL
This graph represents the minimum, maximum, median, first quartile and third quartile in the data set
python
introduction of python
- open source, interpreted, high level , general purpose programming language.
- easiest to learn and nicest to use
- fewer lines of code compare to other languages
Advantages
- easy syntax
- readability
- high level language
- its free and open source
- cross platform
- widely supported
- its safe
- huge amount of additional open source library
data stuctures in python -
List - list is data structure that holds ordered collection of item.
tuple - tuple is imutable ordered sequence of zero or more objects . it allows dublicate values like list
set- is a muteble data structure that holds unordered collection of unique items. {}
directory - is unordered collection of zero or more key-value pair . its mutable
Data Science -
it is branch of computer science where we study how to store , use and analyze data for deriving information from it .
it is about gethering , analizing and desision making .
Creating data science pipeline -
- preparing the data
-performing data analisis
- learning from data
- visualizing
- obtaining insights and data product
Numpy
NumPy stands for Numerical Python.
NumPy is a Python library used for working with arrays.
It has functions for working with linear algebra, fourier transform, and matrices.
Using numpy , mathematical and logical operation on arrays can be perfomed very easily
pandas
Pandas is fast , powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of python programming language .
series - is one dimensional array with axis label
data frames - are two dimensional data structure
Beautiful soup -
is a library that makes easy to scrape information from web pages .
Comments
Post a Comment