 Good morning. First, let me introduce myself. I'm Dr. Simha Chalam, working as an assistant professor at NIRD. And I bring to this particular institute, National Institute of Rural Alignment and Panchayatira's Government of India. My background is genophomatics only. I deal with genophomatics about GIS, remote sensing, GPS, and its applications in various sectors. So this particular field is very, I think, essential of this field, you know, because since you're doing, you're all aware why this is important. Here today we have two units will be covered under the course three of special analysis and application in genophomatics. Under this course, there are two units needs to be covered. The one is special data modeling and database management. The unit two will be data analysis. These two units are very, very important. In fact, data generation, creation, the analysis, which is very, very important, how, what this in this process just will be knowing it. So let us start with the unit one. It's on special data modeling and database management. Do you have any idea about the special data? To whom we call special data? Special data means what? You're not there in my last class, no? No, no? Okay. Special data means data which is having latitude, longitude, or another word is geographical data. Geographical data go special data both. For example, this is my, for example, one small village, revenue village, right? So here there is a road networking. Okay. This is one hamlet in my village. Another hamlet. Okay. And here also we have one more hamlet. What we are seeing in the map, these are all special features or special data because this data is there on the earth. For example, road is special data because it's there on the earth. It got the latitude, longitude and house special data, building special data, forest land, water body, anything which we can see with our eye. These are all called special data, which is there on the earth. You're all aware that the entire globe have been divided into 180 latitudes and 360 longitude values. Where we measure where we are on the earth or anything on the earth, where it lies. It has a location. Where is it? Is it a road, a tree, a building, an institute, or any agricultural land, a water body. These are all geographical features, right? These geographical features where it lies on the earth will be addressed with the help of latitude, longitude only. Aksham Shraikams says that we are aware of that. We have said that there is something on the earth. It is called geotagging. So what GIS basically do is, this geographical data will be represented in the form of map with the help of latitude, longitude. Where the latitude, longitude, you can see the data with the help of latitude, longitude in GIS. This data called special data, data which is there on the earth. These special data modeling and database management. These particular special data, how it can be modeled. There are three kinds of data modeling we have in GIS. One is vector data model, second one is raster data model. Let us see what is those. But before that, as a course you have to understand what is spatial analysis. Spatial analysis, which is very, very important. Spatial analysis has a set of tools used for analyzing spatial data. The quantitative study of phenomena comprises. Phenomena located in space can be termed as spatial analysis. Then the spatial analysis means very simple. Now you know spatial data, these are all spatial data. Whatever we are looking at above the Google, it is called spatial data. It means the quantitative study of phenomena located in space can be termed as spatial analysis. Quantification means measurement. In your revenue village particularly, the village which we located, how much land under agriculture. It is analysis. How much land under water body? It is a spatial analysis. How much land under forest? It is a spatial analysis. How much land under settlement? Quantification. This quantification when you make it for your entire village is crop land. For example, 100 hectares. For example, 20 hectares. When you analyze in a similar way, you have fallow land or wasteland. 10 hectares. Water body 10 hectares. These are called spatial analysis when you quantify. But not in a tabular form. In GIS, as you know, both the data can be seen. One is spatial and the other one is attribute of non-spatial data. It is called a tabular form which is looking at your numbers. It is called non-spatial data. Spatial data means it is called spatial data. It is called spatial data. So, you will be able to see everything here. In spatial analysis, you will be able to see everything. If you click on this particular feature, suppose this is the path. If you click on this path, what is the length of the road? For example, how much is the total green road in our village, block or district? How much is the total paved road, for example? When you quantify it with simple color, you can see it in the map. These are also spatial analysis. When you see these numbers, quantification is called spatial analysis. Basically quantification of your phenomena located in space called termed as spatial analysis. This quantification is very important for planning, monitoring, etc. It could be a research purpose. When you go for the quantification of your spatial, that is very important. Otherwise, we don't know because you have to plan for rural roads. You have to plan in a block. So, how many villages are there in a block? How many villages are there in a block? How many villages are there in a block? And then what is the length of the road that is required for each village? For that, spatial analysis is very important. If you demarcate a block with your boundary, then all existing villages will be demarcated and then linkage roads to those villages. What are all roads having in that particular village will be demarcated through the map? When you demarcate the map, all the existing roads in the map format, JAs will give you automatically length of the road. Each and every village connected with the road, what is the total length required for plan, which you have the kaccha converted into pakka using different schemes? Maybe Pradhananthi, Gamsadak, Mahatma Gandhi, Narega, maybe 15th finance commission. There are different schemes we have in our rural development sector or in any other sector to develop particular road networking that can be used for develop the roads. So, how does JAs work for that? It is very important to do analysis. Exactly where you need it, how much it will be known. Not only roads, anything. Maybe kaccha houses, pakka houses, maybe agricultural land, fallow land, maybe water bodies and rivers etc. How much is perennial, how much is seasonal? Everything specially you can classify here and the quantification will be given as a tabular form by the JAs. It is called special analysis. And it depends on the location of the objects being analyzed and the process examining the locations, attributes and relationship of futures in special data is made possible through the overlay analysis. There are different analysis tools we have. We will discuss what kind of analysis we have. When it comes to the data model, right? Which is very essential component of any operational JAs, data model. Data means what? Normally you forget about JAs. Commonly who do we call data? It is a special data, different. But you know data, right? Who will call data? Informations, very good. Informations means? Right, you are right. Information means what? For your kind of information, I mean for your knowledge I say this data and database and information. Three are look like same but the source is same but terminology should be used in differences differently, right? Data when we call, this is different. After data we create database. Then information. Now come to the data. Data means raw facts and figures called data, RAW. Raw facts and figures called data. After collecting your data, you create database. Database means an organized collection of data. An organized collection of data. After that you will get information. Data means processed data called an information. Processed data called an information. Sir, what is this? Everything is data. Right? Yes, exactly. These, for example, somebody is going to teach or any immediate to collect the data. Census data example. Census data is collected from where? Every house is collected. What is the owner of the house? How many people are staying? How many are male? How many are female? How is its literacy rate? How many are A's? Right? Occupation, land holding, housing, amenities. We collect number of things, you know, when we go for best land survey data or census data. So your friend, for example, is a teacher. He is going to get census data. You are asking, where are you going? Very outgoing. Then he or she will tell that why I am going to get census data. Please remind that never use... No, no, I am going to collect census information. Who uses it? Never. Never use census database. They use only census data collection. Because they are going to collect the data which is raw facts and figures. Just raw facts and figures they are going to collect. This is called data. And then second one is database. When they come back, they put in a database format organized manner like CDL number, head of the household name, total population, total male, total female, right? And then occupation, then housing amenities like whether electricity is available or not, drinking water available or not, water, sedentary litter available or not. These are all, when you put in an organized way with the CDL number of the house in that particular device or GP, this is called database. After getting it, the proper manner when you organize it is called database. Then finally use your, you never use directly data, database in your projects or in your study. We use in study only information which comes from data and database only. The information means process data. As I said, if you want to plan health facility in your block or district, then you have to know your, what are all existing PhDs we have in my district so that I can find out the gap. So what I have to do? I have to process entire GP, entire district PhDs, available PhDs. No, no, I want to plan only for, for example, I want to know how many female population we have. Then I have to process entire GP if you want to at GP level. Process means the whole house, the whole population of the house, the whole process of female male will be different. No, no, no, I want only the female with more than 60 years if I want to know. For different scheme purpose, maybe NSAP, National Social Essence Program we have where after 60 years they get the pension from the government. Then you have to process the entire database with their A's which you have in your database. Then you will get information. So process data called information which is very useful. Finally we use only information in your projects. You do any thesis or any project, common project also. Directly data we never use, database we never use. We use only information which comes from data and database. So data, database, information. These three are very important and here the similar way in GIS we called spatial data. Similar way data, for example, this is a data. Roads are data. Settlements are data. Water bodies are data. Boundaries are data. This is spatial data, data which is there on the earth. Geographical data, spatial data. Same. So this spatial data will be represented in GIS in different ways. One is vector and second one is raster. So data models just we will see. This is characteristic of data, means different types of spatial data will be like 2D we have, 3D data we have, then discrete or continuous data we have. Different features are not same. Some features will be point. Some will be line, linear features like road, stream, river, etc. Some will be polygon or area. For example, I want to see water body, it's area concept. I want to see forest, forest land, it's area concept. So three kinds of data will be represented, especially in spatial data and also we have 2D and 3D data also. Spatial data will be represented in both the ways. Three dimensional view, exactly how your physiography look like in your study area or geographical area. Second one is 2D. One is 3D, second one is 2D which we see in the map form. So both the data can be visualized, spatial data can be visualized in the spatial analysis. There are different sources of spatial data like data acquisition from different sources like conventional along with the map sources and aerial and remote sensing satellite remote sensing field data existing. One is spatial data source, another one is non-spatial data source. Spatial data means this is the spatial data. Sir, road information, water body information. We have to go and draw and we can do mapping in GIS. So this is the source of spatial data. Satellite remote sensing as a background, using GIS you will demarcate it. You will map it. Second option will be aerial photography which you will get even three dimensional view. Z value also you will see in aerial photography imagery. Then we have some field sources like GPS is the main source to collect field data, spatial data. GPS like a platitude longitude can be collected. So spatial data can be collected through the GPS also. Satellite map sources like there are number of web portals now available. For spatial data it is not just to generate, it is already generated. Like Bhuvan is given by NRS National Remote Sensing Center. This type it in the Google it then you will get various applications specially directly you can visualize by the Bhuvan. And also we have USGS American portal. Satellite data free of cost can be downloaded from Bhuvan. Then earth explorer where you can download three dimensional view. Dem data also you can download it. Even Google it is another source which is open source but still you can able to visualize the spatial data. So these are all different sources of spatial data where you can able to visualize and extract and these sources using these sources you can create new spatial data in your respective study area GIS software. Then as I said we have two types of spatial data models. When we use to analyze spatial data basically we use GIS software. In this two kinds of spatial data models are available. One is raster another one is vector spatial data model. What do you mean by raster? What do you mean by the vector? Just we see because this is important when you prepare for examinations there are types of spatial data. What is spatial data? What is spatial analysis? Then come to the spatial data models. There are two types of spatial data models available. One is raster second model is vector. In raster data model you represent all data's represent with the pixel form. Pixel you understand no? Pixel means mobile may use 10 megapixel or 20 megapixel what does it mean? Why the 20 megapixel camera mobile are costly than 10 megapixel? Anybody? Pixel means what definition of the pixel? Pixel means picture element. Pixel means picture element. These are all cells which are saying it is nothing but a pixel. Your raster data will be shown this way. Grid form pixel means picture element. Pixel means cells. You get this one. Even your TV if you go to home and see closely then your TV also picture represents in a raster form which made with the pixels. Pixel means picture element. So this pixel size is if it is big then the clarity will be less. If pixel size is small, clarity will be high. Whereas number of pixels same unit of area, number of pixels is so high. 10 is 20, 30, 50. If it is 50 same area then small pixels will be bigger. Obvious is depends on the clarity your picture depends on the clarity based on this pixel size only which calls raster form. In GIS also data will be represented in raster form for pixels like your toposheet satellite emissary, map and all when you scan it it comes as a raster form. If I want to represent this road in the raster form see the whole pixel will have to see how my road is going. All pixels will be marked with one color. Not in exact position or location. Suppose road from here to here road I want to represent in the raster form in the pixel form it will be marked. Whereas same road if you want to represent in vector form in vector form basically data will be represented in three ways point, line, polygon. One is point, line, polygon. Here point for example is a building specific point. In case of this road line exactly it will go line. Here it will not vary but road is going like this but it will represent entire pixel in raster form. So vector form is very clear than the raster form. It occupies very less space and accuracy is very high in vector form than the raster form. Then this is the polygon area water body etc. So these two are very important raster. You can see here sorry what is the raster here it is clearly given. It is a various called a grid or raster map or surface overlay. Raster represents continuous surface with the help of discrete grids or cells which we have seen in the picture. But for data stored and analysis a raster is divided into rows and columns forming cells. Cells are called as pixels which I just said what is the pixel. It is a picture element and then if you see the vector data model it is like point line polygon which will be used for representing your entire vector special data. The point has dimension had only one property of location one x and one y. One latitude and one longitude. The various line is having series of latitude longitude value. The polygon also having a series of flat long value but when it starts to closed line we call closed line called as a polygon. But each and every point of the line you will have the latitude and longitude value. So when you design draw a map both will be used. When you do special analysis vector is very clear accuracy than the raster for special analysis also representation also in a data model. Always you prefer for vector data model only for putting into the map form using the GIS. But sir why then raster for example I want to 3D view I cannot extract in the vector form. I want to do some analysis clubbing, merging, overlying some other special data like land use, land power soil, dryness. Different layer analysis may be kept together then I have to use the raster form only not in the vector form. So these two are very important in special data. Vector model and raster model. Please remember these things how it will be represented over data vector and raster. These are all some of the data models used in GIS like CAD graphical, image raster. These are data model. Network model, geo relational tin object. If you see out of this we have raster format will be around 2 or 3 only. 2 types only. That is image format and raster grid. The image format or .img is a raster format. When basically national remote sensing agency NRSC provide your satellite data in image format. We have one more format called TIFF .TIFF format too. This TIFF format also will be called as a raster format like your photograph scan to push it and all will be stored in TIFF format. This is called as raster. Otherwise all you see in the vector format which object oriented was the geographical point oriented. Everything will be shown as a vector format in GIS. 90% data will be used as a vector in GIS analysis. Remember that. Then the database management technology in GIS. There are many more data bases. We use SQL server. We use many more. But basically in GIS when we use special data we have. One is special data. The other one is non-special data. As I said special data means which you see here in the map will form. Like this line point polygon which demarcated in the map form we called special data. Whereas non-special or attribute data means about the data. For example, there is a road this is special data. Here about this road number of attributes will be given in the GIS. If you click on this road like name of the road whether it is national highway state highway, district highway what is this. And then type of the road another attribute black top, gravel road what kind of road it is. And then condition of the road present condition bad condition, good condition need to replace something. In which year it was constructed which scheme it was constructed what is the width what is the length the connected from point 1 to point 2 point A to point B. I mean from way to way this road is connected these are all called attributes or non-special data. GIS when you create a database both data will be there you can see there a database how it is managed as attribute 1, attribute 2 attribute 3 which you are saying attribute 1, 2, 3, 4 like that. You have shape that is polygon or line at point right anything on the point for example one point of the road which is line we are saying about this line you have number of attributes like this both data is linked together using GIS software. Special and attribute data are non-special data. Special data means graphical data using point line polygon it will be demarcated which you have all geographical entities and attribute data means about that particular data quantification you say quality you say which can be demarcated or linked called non-special or attribute data this way right. So, database management will be GIS basically two ways like this right. Then they are all different standard database we have and data definition and adding records query is another very important analysis factor in GIS we will see what kind of query we have the deleting records data security data dictionary query many more what are they and these are the some DBMS which being used in GIS starting from Microsoft Access Oracle Cybase, SQL Server DB2 there are many more the basically we use Microsoft Access and Oracle and then SQL Server also for when we go for WebJS etc. So, under this database management we use geo-referencing of the object I think what is geo-referencing you might have read already geo-referencing for example I scan some toposhe when I scan it some false values will come means the false values of the grid as I said earlier when you scan a toposhe it will come in the grid form. This pixel should have a value when you scan it it comes 00 right. But if you do the geo-referencing of the objects what you do you will assign the original latitude, longitude of the earth exactly the toposhe where it is from the earth that latlong when you assign four corners using this particular process the entire pixel will get latlong value the whole data gets latlong value this process called geo-referencing in GIS without geo-referencing without latitude longitude you cannot do any analysis you cannot do any mapping these two are very important so the data which you have it may be satellite image it may be toposhe it may be any kind of reference data that must have done first geo-referencing which got the latitude longitude once you go to the geo-referencing the data will get automatically latitude longitude right in raster form each never pixel get the latlong so this process is very very important the coordinate system after geo-referencing we have a map projection that we give basically WS-84 that is World Geographic Survey we enter the coordinate system once you do the coordinate system the data exactly goes and sit where it is on the earth that will be reflected with the latlong value and after that in GIS once you create the data you do the geo-referencing in projection you have to extract the data from raster to vector while digitizing there may be manual digitization right automatic digitizing tool there are many more tools are there to extract from convert from raster to vector using this many more options one is manual digitization right like low capital cost just directly while seeing backdrop of satellite image or toposhe you can digitize and then we have automatic scanning also the another data extraction tool and then entry of coordinates using coordinate geometry you can enter latlong value so that things will be extracted and conversion of existing digital data is another technique becoming increasing popularly data input to conversion of existing digital data directly raster to vector it will convert whole data so new tool has come up but it was not there but when you use the first option the manual digitization is the very high accurate option where you can accurately extract your special features from raster to vector right this is very very important for generating any data type or data model in GIS environment so that is all about the first unit data input and methods which we have to input your data upto here we do we digitize and then you also go for editing once you digitize this or scaling this you have to do some editing editing means what normal editing means what corrections no editing what we do corrections no in GIS also when you digitize what process for example my study is upto here but when I digitize by mistake upto here it gone this portion is not mine right in editing what I do I cut this one remove this one and sometimes when I digitizing road but I given a gap in editing I have to mix with that road it should match it should connect so these called overlapping under shoots over shoots this process has to be done for under editing part so then once you do the editing then final data will come otherwise after extraction input digitization right or anything scanning digitization anything option use that extraction then you have to do the editing also in editing final output will be shown and also you have attribute editing also by mistake I given culture road but it is culture road when I collected from GPS or other records then in attribute editing also you can convert into park road this is non-special editing this is special editing editing both editing can be done one is special editing and another one is non-special editing right the collection of the errors called editing special and non-special both that is the final I mean editing is the final option in geo database creation special analysis and then database creation sorry data model, data input, data extraction then editing this is entire process in GIS