 Hello, Welcome to SSUnitex Succeed this side and this is continuation of PySpark tutorial. So today we are going to see about the Select inside the PySpark. So today's agenda is first we will see how we can use the Select and second we will see about the alias. So how we can provide any alias of any column. So let me quickly go inside the browser and we will try to see in practical. So here as we can see we are having this df data frame and we are loading the data from the csv file and after that we are just renaming one of the column which is the item name. So we have renamed from the item name here we have space we have just removed the space from here. So now as we could see we are having total 6 columns on this data frame. Now if we want to select all the columns then simply we can use the display df as you can see here but another way we can also use that is the Select. So while we are specifying the Select inside that we should be going to specify the columns that we want. So if you want all the column either you can specify one by one like the SOID then SODATE then item code then item name then quantity and then value. Now let me try to execute. So we should be going to see all these columns in the output. So as we could see here. So let's assume if our requirement is we don't want to get the item codes in the output then we simply go here and try to remove the item code from the Select statement. Now let me try to execute it again. So this time it should be going to display only 5 columns. So let me execute it again. So as you could see we are having this SOID SODATE and item code has not been here. So that is gone. So this is the first way. One way instead of specifying all these columns if you want to select all the columns we can specify ASTHIC as well. So let me execute. So it will be going to return all those columns. So as we can see SOID SODATE, item code all these 6 columns are here. Now the next thing you need to understand if we want to make any column as a calculative column. So what does it mean? So here as we can see we are having the quantity and value. So this value is the unit price for that particular item. So if we want to calculate the amount, how the cost of that particular item then we should be going to multiply quantity with the value. So how we can do that? So here simply let me try to select all the columns and after that we want an additional column. So that additional column will be DF dot your quantity then multiply DF dot value. Now let me try to execute. So it should be going to add one more column here and that is the quantity and value. So let me try to scroll right side and we can see. And here we can see the multiplication of the quantity and value is here. So one additional column has been added in this data frame and that is quantity multiplication with the value that we are doing over here. But remember as you can see the column name is not proper. The column name we want as amount. So simply we can provide the alias name of this column. So how we can do that? Here we should be going to use the alias. Now here let me try to specify as amount. Now let me try to execute it. So what we should be seeing? We should be seeing one more additional column here and amount. So the column name has been renamed. So simply you can use the selected statement along with the alias name if you want to specify the alias of any particular column. And here simply you can specify all those columns that you want in the output. So I hope guys you have understand how we can use the selected statement along with the alias. Thank you so much for watching this video. See you in the next video.