WEBVTT

00:00.080 --> 00:01.000
Welcome back.

00:01.040 --> 00:06.160
We installed and imported successfully the pandas library.

00:06.200 --> 00:10.000
Now let's move and use it in our notebook.

00:10.200 --> 00:15.720
The main object in pandas is the data frame, a 2D table with rows and columns.

00:15.960 --> 00:22.960
In order to create a simple data frame, start with the name of the variable and the equal sign.

00:23.000 --> 00:28.840
Open the brackets and inside it you notice that it's similar to the dictionary.

00:28.880 --> 00:39.160
We have the key and the value separated by the columns, and we have the key value, key value and key

00:39.360 --> 00:40.720
value pairs.

00:40.760 --> 00:41.240
Okay.

00:41.480 --> 00:43.800
So the first key is name.

00:44.000 --> 00:46.960
The value is an array of strings.

00:46.960 --> 00:49.360
The second key is age.

00:49.520 --> 00:52.720
The value is an integer.

00:53.920 --> 00:55.240
The third key.

00:55.720 --> 01:01.480
The third value of the key city is an array of strings.

01:01.720 --> 01:09.120
The fourth one is grade, and the value is array of integers or list of integers.

01:09.520 --> 01:10.440
It's very simple.

01:10.440 --> 01:12.360
This is how we create the data.

01:12.360 --> 01:14.520
It's similar to dictionary.

01:14.560 --> 01:18.750
Okay, think about it like This again.

01:20.070 --> 01:27.270
Data frame primarily data structure for tabular data like Excel spreadsheets.

01:27.510 --> 01:31.790
One dimensional array like object for single columns.

01:32.070 --> 01:33.910
Data cleaning and transformation.

01:33.910 --> 01:36.470
Those are the key features of Panda.

01:36.510 --> 01:39.150
Okay, so think about it like.

01:41.510 --> 01:42.270
Dictionary.

01:43.590 --> 01:48.990
Now let's use this data frame df which is data frame.

01:51.030 --> 01:53.310
Equals to PD.

01:53.830 --> 01:54.470
PD.

01:55.070 --> 01:55.870
As pd.

01:56.390 --> 01:58.550
So I'm using this library.

01:58.550 --> 02:07.390
And here I am using the data frame in order to pass the dictionary to it again.

02:07.510 --> 02:11.390
PD is the alias of pandas.

02:11.430 --> 02:14.590
The shortcut of the pandas data frame.

02:14.750 --> 02:18.510
This is the pandas main data structure.

02:18.550 --> 02:24.190
A 2D dimensional table with rows and columns data.

02:24.390 --> 02:29.230
This is the input data that will be converted into data frame.

02:29.270 --> 02:40.490
DF the assigning that this assigns that created data frame to a variable called D that would be referenced

02:40.490 --> 02:41.250
later.

02:41.370 --> 02:45.490
So here we are displaying the data frame df.

02:45.770 --> 02:48.530
Let me run this and here we go.

02:48.770 --> 02:51.690
We have this visualization.

02:51.690 --> 02:56.850
This is our table name age city grade 0123.

02:57.130 --> 02:58.530
And those are.

03:02.130 --> 03:10.010
This is very library we're going to use in uh in this course and later in the next courses.

03:10.170 --> 03:12.890
So I like this.

03:13.450 --> 03:15.850
You have many features.

03:16.170 --> 03:19.250
Convert this data frame into an interactive.

03:19.650 --> 03:21.250
You can notice like this.

03:25.490 --> 03:30.050
You can see the distributions time scale and others.

03:30.090 --> 03:32.450
This is a very important library.

03:32.690 --> 03:40.130
You may be use it in your in your projects and your university and others.

03:40.170 --> 03:40.570
Okay.

03:40.610 --> 03:42.210
So clear output.

03:42.250 --> 03:44.770
Run it again and here we go.

03:44.810 --> 03:49.210
This is how we get the table from data.
