 Hello everyone, welcome to NPTEL course on rural water resource management. This is week 11 lecture 1. In the past weeks, we have been looking at how groundwater, rural water, surface water can be preserved using natural and engineered settings. Basically, in week 10, we looked at natural rainwater harvesting, wherein we capture the surface water and then slow it down using nature-based solutions. Rainwater comes in big volumes sometimes and in quick time, which means in a concentrated time. So it is better to capture this rainfall and then slowly recharge it and slowly let it grow as surface runoff. Otherwise, all the water would wash away and erode the soil and also reduce your groundwater. Then we looked at methods to store this water using engineered solutions and nature-based solutions. In some locations, there is not much nature-based solutions you could do or the scale is too big. So you have engineered solutions like dams and other resources infrastructure. In week 10, we focused mostly on nature-based solutions and we looked at where naturally we can store water like in depressions, lakes, ponds, etc. We also looked at the option of using vegetation to slow down the water because your vegetation acts like a buffer or like a challenge the forceful flow of water. It slows down and then lets water to recharge. Also, there is some storage in the plants area which can actually help in storing water like a wetland atmosphere. Then we looked at regions where there has been loss of biodiversity, loss of forest, plants and then we looked at options to reintroduce forests in these locations, especially the afforestation and agroforestry. We looked at Miyawaki method where we brought small plants, take them in shade and then introduce them in the wild. All these require a lot of time and maintenance up to a particular point after which the system can be involved. So with this, we complete the Rural Water Resource Management lecture based on the principles, the fundamentals, the drivers, everything has been discussed. Now we will look at the databases. So with the understanding you have created over the 10 weeks, it is important to have data to work on and that is where we will be focusing on week 11 and week 12. Basically, we will be looking at where we can get the data. What are the cautious points that we have to follow while downloading these data and what is the principle use of these data? So Rural Water Database can be obtained from multiple sources. I am going to pull up the pointer to just show that. NGOs collect a lot of data. So we will look at how NGOs collect data, why they collect data in this lecture. Papers, journal articles like academics like me where we write a lot of papers to show the scientific scholar at activities and also work. You will look at where we store these data. Then the next step is the government agencies because NGOs are private non-government agencies then when you go to the government agencies there are two major types your state agencies and the central agencies. The state agencies have their own budgets, small scale budgets compared to the central government and they work on the state's need. They work very focussely on the state water resources and under the state water resources rural water resources is key. Then we go to central agencies where we have thematic agencies. For example, we have climate change and forestry, we have water resources, gelgeven mission, all these different agencies within the government. How they collect data, where do they store it and how can you access it. Then we will also be looking at remote sensing platform because some of the databases I have mentioned may not be able to capture the entire spatial and temporal resolution. Spatial means across India every single district can you collect data is a question or even every single block. Also some of these data are not collected daily whereas your remote sensing can at least give you weekly estimates. So in this lecture series, we are not going to pick one and say this is the best because it depends on your location and where you want to do your research for the time period. Also the research question. For example, if I am looking at groundwater for rural areas, the observation record is only once every four months. So can I augment it, can I add remote sensing data is the key. Then we look at central water agencies where remote sensing is also a part. For example, ISRO is a central agency, government agency and it has a lot of remote sensing products which the government agencies are using. So it's not one data is better than the other. How can we use them together? I will discuss in this and the week after this lecture, which is the last week. So we are almost coming to the end of the rural water resource management course and I would like to focus on the principles that we discussed in class are enough to at least create the basic understanding and start managing the water resources. To have hands-on experience, we are looking at these data sets. So let's get into today's lecture on the first aspect which is your NGO and paper-based data. I will also, while completing this lecture, I will also go into the WRIS database and show exactly where these data are stored so that you can easily retrieve the data. For example, here I'm showing the groundwater aquifer type data across India and you could see that all this data can be downloaded and used for your water resource assessment. In fact, there are places in this region where some of the groundwater structures that we discussed, the natural reaches structures, the engineered structures cannot work. So we need to understand these potentials which the groundwater board has done and put it on these websites. I would say WRIS is a more centralized location of your data and I'll be happy to explain most important data sets and let you learn in due course how to use these data sets. So without further ado, the first rural water data resources we'll be looking at is publications. So publications are written by scientific scholars and students and who are active in research. Most of these are found in the Google Scholar. So basically, while Googling your data that you need, you can go to Google Scholar. Google Scholar is part of Google which focuses only on papers, journal articles. For example, you have Google Books, you have Google Scholar. Both are scientific knowledge products are kept. Let's say for example, if you want to go from one location to another location, you use Google Maps. So how you use Google Maps? Because it has a separate vertical under the Google umbrella only for maps. Similarly, you have one vertical for scientific publications and that is called Google Scholar. Here, you can take a lot of paper which has data. For example, I'll spend one more minute on this Google Scholar. I'm showing you a report that has come in Google Scholar and what you could do is you could get into these resources and collect the data. I'll also explain how you collect the data and how you cite it. So what do NGOs do? NGOs write also reports and publications and research manuals. These may not be accessed in Google Scholar. You should use only the Google to find it. Why is because most of the papers that are put in the Google Scholar database is peer review, which means it goes through a scientific process where the paper's content and the data, the results is checked by a couple of scientists or researchers or academics. They should give a thumbs up saying this paper is good. Then it comes into publication domain and then it goes to Google Scholar. So you first write a paper, you send it to a review process. The reviewers give comments, that is the peer review. And then once the paper is accepted, it goes to the publisher. After the publishing is done, it goes to Google Scholar. So that is what publications are. However, NGOs don't have to go through that route because this process is time consuming and they know that their data is good. It's a very localized data and not much science but applications of science principles and data. So for that, they would keep NGOs reports, publication manuals in open domain and publications, the research publications, paper publications sometimes require money to publish. So since the NGOs work on a very short budget and it is not a profit making company or industry, so they would publish in open domain, which is Google also. So I'll go through some of the reports and how they're doing it. In this week lecture, I'll also look at first state databases because state databases are huge and they have a lot of data that is collected and maintained rigorously in their work domain. So think about a massive server. A massive server can stay in the center of example, the capital Delhi and then have data for all the regions across India. However, the state can also have their own servers and from the state server, the state data can be housed and shared with public. So we will look at how this state databases are run, what are the data that they collect and how you can access it. Okay, so before we move on to the state data basis and an example, I would like to show one example of an NGO report. So here's one report and inside the report, what you would find is a lot of government archived data and they collect data. So similar the publications I mentioned where the scientific community writes proposals and collects data. The NGOs also do the same method of writing proposals and collects data. They analyze it using the own team scientific team and they put it in these websites or in this books. These books are kind of open source. It is not a hard copy. For example, it is stored in as a PDF in the database, which can be shared with everyone. So there are a lot of databases. Research Gate is one database where a lot of papers you can access and publications. Google Scholar, as I said, has all these publications. Then they write reports, annual reports. For example, every year what have they done in terms of securing funds and for the funding agency. Like for example, you have Melinda Gates Foundation, which is giving money for a rural water resource management work. The NGOs take the money, they work on the field and then they collect data, put it into the database and reports and publish it. It may not go into a scientific publication, but still it can go into a good report like this. Then you have the manuals, which they use for training and also building capacity. See NGOs do a lot of extension work, which means they connect people, they talk to people and they train them to manage the water resources. So for training, they collect data, they analyze data and they reproduce data. So all these work in the manuals, tutorials that they make and they publish it. Then the state databases, as I said, there are a lot of databases created for each state and some of the data are dam level. For example, you have the irrigation projects, which is we covered in lecture 9, week 9. We looked at engineered data, engineered infrastructures and we saw how dams help. So now we need to know the levels so that we know how much discharge is left, how much water comes in, what goes out of the system. So to create all these water budgets, the database is created at state level and dam level, discharge, rainfall, groundwater level is collected. This is at a state level and they also do a lot of dissemination. So let's look at focusing on the publications first. So each one I will go in detail. What can you get from publications? Please be careful on assessing a lot of content in the paper. See, the paper is not going to give you all the data you need. You will have to search for the data that you need. For example, the study data can be found as written inside the lines. For example, they will say the hydraulic conductivity of the aquifer is 10 meters per second. So that 10 value is the data you need for your groundwater assessment. In other words, they could say that, for example, this region in data watershed, they could say that the rainfall, annual rainfall is 655 millimeters. See, that number is because they put rain gauges, they collected rainfall in the village. So this was done by the MARV team. Do a lot of NGOs. So they collect the data and they write as a publication. So in the publication, you have this data as rainfall for DARTA is 655 millimeters. This data you may not get readily in government records or the state records because maybe they are not monitoring at that particular village. However, this study has done it. So what I am trying to tell is search for publications. Put the village name, put rainfall in Google Scholar. You may get some hits and use that hit to collect or make data. So this is kind of your data mining. I am getting into more data labels here. And data mining is a term where you go and mine, dig and take out data. So from the paper, you should have that time and expertise to collect data from a paper. Sometimes publications also give you a link to the data. For example, nature publications, hype and publications, they always have a link or a database linked to the paper because they want people to use it. All these are open source data. It is not a data which is stored in the government agencies because it is not funded by them. But the people who fund the paper or the study, they would like to see the data for public use, which is open source data. So you could go to these links, the paper links, click the data and use it. For example, this paper, Marvie, you could go into the funder's webpage, which is ACIR, and you can take the data and use it for your studies and understanding, etc. Also, some agencies put a mandate saying that the data has to be shared to public. For example, if I am writing a DST project, which is Department of Science and Technology, Government of India, they will say sometimes, please make the data available for public. So how do you do it is you create these data as an Excel sheet or a database and link it into the paper. You can give it a Google account, Google Drive account or some server which is from your university. For example, IIT Bombay, I can put it in a cloud and then give the data. So the point here is those information are put inside the publication. So you need to go in and collect the data from the papers. So please skim through the paper. At the end, it will say acknowledgement and also supplement data or data in appendix. So then you can go back and then click and take the data out. Also, they will have data in between the lines, as I told you. For example, there is a rainfall is 655 millimeters and or they will also put it as a table. In the table, you have data. You can take out the data, but very sure you should always acknowledge it. I will come to that point. So you can collect data like discharge, rainfall, most importantly the boundaries. The boundary of this watershed is not available online. You cannot get it online. So you could actually get it from the paper or email the author. Every paper will have the details of the author in the paper. It is called corresponding author. So look for the word corresponding author. You can email them and give the data. So then you could use that data to create these maps by yourself, which is the watershed is a very important aspect in the water management, I said. And then you could collect all the rainfall data, other data that is inside the paper. You could also write to the authors asking them about these data sets. So you can use averages and trends that they talk about in the data. For example, they would say the average rainfall is 655 millimeters per year and the 20-year trend shows that there is a slight increase in rainfall of 10 millimeters per year. So what you could say 20 years every year you have a 10 millimeter increase or is it after 20 years a 10 millimeter increase. You can get these information from the paper so that for your area, if there is a 20 millimeter increase per year and there is a 10-year difference, you can easily calculate how much rainfall has increased in your region. So you can kind of extrapolate or take the data from the previous data. Email the author. As I said, you can write to authors. I have a lot of requests from my papers to say, can you please share the data? Most of the authors share the data. For example, I share the data. However, if I am working on the paper again and I need to save the data until I write my papers, I will not give it. So that is one caveat. So if the author says, sorry, I cannot give it, please understand. They may be writing a paper or it may be sensitive data to things. The last thing is they'll be happy to share the data and work with you on these water budgets. So you can also ask them if they could work with you. Most importantly, from publications, please cite them. Citations is mandatory because they've put some time, collected the data and put it on a publication. All they would like to see is, if you're collecting data without asking the author, if you're collecting from the papers, then you have to give them credit. Let's see, for example, this is the watershed Dharthas groundwater level. And you would see from Jan 94 to Jan 14, the data has been collected. So what you should do is you should tell that, okay, in Jan 2002, I have this groundwater level. I take that water level and I can understand that the water level is declining and then it's stabilized. See, all these trends you can make. From here, you can easily say that Jan 2002 is around 478. It's average you can take just from the figure. So this is how I'm saying you could take the data or create an understanding about the water from publications. If you're going to use it in your thesis, in your work, in your publication or report, please cite them. You have to give citations. You can Google and see what is citation and how to cite. Basically, you have to write the author's name in your report. For example, if you're using this work, you would say, Marvi, you have to cite. Maybe you can put the whole website saying that thanks to Marvi, Marvi.org.in for the paper, for the paper which had the data. And that data you have used in your research. Then I will go to rural water data from NGOs. This is very similar to the publications, but these are books and papers that you can find not in Google Scholar, but in other, just a very simple Google search or in the agency's webpage, like here in me, here atree, you can go there and collect the data. So what do they have is project reports. As I said, every project will have a report in an NGO. Let's take this example. Water management in the Noyan Basin, situation analysis. I was part of this study. And what we did is we collected a lot of data from the government. So inside that, you see a lot of curves, trends, graphs from the government data. And physically, I collected a lot of data for groundwater, dam, tank levels, etc. Those data are also mapped inside the paper. So you could ask the atree office to share the data if they are willing, or you could take from the graph, you could understand the trends and write it in your report. So that is where you have project reports which have the data and your annual reports from NGOs, which they submit annually to the government or to the funding agency, will have all these data as a supplement. Mostly NGO data are available because they would like people to use it, unlike academics, because academics papers, they write a lot and they have to save the data until the papers are published. Whereas these type of NGOs, they promote more data sharing and they actually give you the data on request. Then they have a lot of training events. For example, in EME, they do a lot of groundwater trainings, surface water training. And for the training, they give some data and also take you to the field and collect data. All these data are stored in the database webpage. You can click and access it. Then there are monographs and white papers from the NGOs, which are mostly focused detailed study reports or white papers which is focused writing of a particular issue. So monographs will have a lot of data about a study area. So here if we say no yield river basin, if the lot of data is collected for that basin, then monographs are established. Mono is single and graphs. So it is more studies and data from a particular region. Whereas white papers is focusing on a particular issue. So here is where you could see that the rural water data can be mined from papers and NGOs and can be used in your research. For example, in this book for the Ganges Water Basin, they had conducted study on transmissivity hydraulic conductivity specific yield for the region. Sometimes that data is not available readily for the public. However, the data is here. You can go and take the data and use it in your models. Also the layers, how many layers of the aquifer are given in these reports. You could take it and use it. They've done conceptual models and a hydrological model setup, which you can use to understand the water budget and make water management better. So this is how you could use report data. I'm giving one more example here where you can see groundwater depletion and location of the tanks. You can see here there is a very a pallium tank and then there is an ortho-pallium reservoir. All these things can be mapped into the GIS layer and that layer is inside this book, which is the water management in the Noil Gryffindor Basin. It talks about how the water travels from the Willingery Hills through the villages and cities and then goes out. Throughout the lot of data, as I said, we collected. I collected groundwater data, tank data and it is in the book. You could refer the book to study about that region. Similarly, there are other reports for various regions nationally and internationally that you could collect for your work. I hope I clarified how you could mine data using Google, using the books, using the publications. Please don't say that there is not enough data. That may be the case, but you can also write to these people as email and get the data for your work. So with this, I would conclude today's lecture. I would see you in the next lecture when we discuss more on databases at state level. Thank you.