 Hello, my name is Megan Raisley and I'm a notebook coordinator with the Environmental Enforcement Watch Project of the Environmental Data and Governance Initiative. I'm going to walk you through our watershed notebook, which is a data science tool that allows you to see what facilities are polluting under the Clean Water Act and watersheds that are interesting and relevant to you. So I'm going to go ahead and share my screen here. Okay, awesome. So a notebook is shorthand for a Jupiter notebook, which is a way to write and share Python code. These notebooks are accessible on a platform known as Google Collab, which allows you to access and visualize data without actually needing to do any coding. You can access this notebook to help provide and contextualize data for watershed groups across the United States. It allows you to select a geographic area you probably know for your area of interest, a zip code, and find the USGS watershed boundaries known as hydrologic unit codes or HUP codes that intersects with the zip code. The rest of the notebook then gathers information about facilities that report under the Clean Water Act and the watershed. You can use data from the EPA's Environmental Compliance and History Online, or ECHO, database. EDGE keeps our own copy of this database that is updated every two weeks. To use this notebook, you proceed through the steps, clicking the small play button that appears in the top left corner under each step, and then make choices about what you want to see. I'm just walking you through this notebook that I ran right before making this video today. So first you're going to see this introduction and instructions, which I sort of walk through already. And there's some great additional links to more information here, and I encourage you to read through it, and when you run this yourself. So on step one, just begin, we're simply going to load some code to get started. So to do that we're going to go to the upper left corner of the cell and click this play button. And what you'll see at the end of this is the word done printed right here so we know that it's worked we've loaded in some code that we needed the same goes for step two. We're just loading some code to help us so we're going to hit this button, and then when it's done, we're good to go. So step three is where you get to start making some choices we have the opportunity to type in one or more zip codes we're interested. So I run this button, this drop down is going to this not this drop down this little box is going to appear and I can type in whatever zip code I want. In this case, I've chosen zip code 14303, which is the Niagara Falls area. If you want to include additional zip code do do it in this format with commas between. So go ahead and run part B of three, which provides more context about hydrologic unit codes which I went through earlier. This is the way the USGS identifies watersheds and they're defined at different scales so this little graphic here from USGS helps better define the different scales, the number after the code. So two, four, six, eight, 10 or 12 indicates the number of digits in that particular number with more digits indicating a smaller watershed. So, in our notebook you can choose to run to look at Huck eight or Huck 10 watersheds that intersects with the zip code that you already chose. So Huck eight describes sub basins of which they're approximately 2200 in the United States, and then Huck 10 is a slightly smaller region with approximately 20,000 across the United States. Here I went ahead and chose Huck 10 for my drop down because I'm interested in a little bit of a smaller area close to my zip code. So again, now that we've chosen a zip code and we've chosen a watershed scale we're going to go ahead and run step four, which just go ahead and collect the data that we need from our copy of the echo database. And after you run this step you're not going to see a drop down or data or anything you just need to see that it's completed after pressing this play button. And then we're going to go ahead to step five, which starts to actually visualize our data. So running step five generates a map of all the facilities regulated under the Clean Water Act in our watershed. You can zoom in to see the zip code we selected as well as click on individual facility names. So this blue large boundary is the watershed that intersects with our zip code and then you see these little circles here are the facilities. And then this smaller blue boundary is the actual zip code that we selected so that's it code 14 through three. And if we zoom in here to get down to individual facilities. You can actually figure out what the names of them are so if you're interested in a particular facility or where facilities might be. This is a great place to start looking at that. So step five is also automatically generated this chart that shows the top 20 violators in this watershed for the last 13 quarters so that's the last three years plus the most recent quarter. A facility may be in violation for a quarter because they failed to submit a report violated a permit failed to take action on a violation or other reasons. The facilities that have been in violation the same number of quarters are not listed in any particular order. So if we're looking at this particular chart, Harbor Place Marine Sales isn't in more violation than 3M. For example, they've both been in violation 13 quarters. They have many more than 20 facilities that have been in violation during the last 13 quarters and they're not reported in any particular order within the echo database facilities that have been in violation for many quarters may not appear on this chart so this isn't a comprehensive chart. But it's, but it's a good place to start. So we can see here in this watershed. It looks like 17 of the top 20 facilities have been in violation the entirety of the last 13 quarters. And if you run the notebook for a place you know you might actually recognize some of the some of the facility names here, which is awesome. So step six allows you to dig further into your particular interest, and you get to make some choices here. So running the cell at six a right here generates a dropdown list to select a data set and three buttons that allow you to select what type of visual visualization you like to produce. So the data set options include clean water activations inspections penalties. 2020 discharge monitoring reports and excellent violations here I've selected clean water activations. And I've also decided to chart those violations so you can choose to map chart, map them chart them or make a table. There's a little more information on each of these different data set options up here as well as at this link. So once you've made your selection you want to go ahead and run. Part B here, which actually produces what you've asked so first it'd be really helpful if we could read this legend so this is a chart of clean water activations in our watershed. And we need to understand what this legend is telling us all these different colors are telling us. So I'm going to scroll back up here, which gives us a little more information about the legend if we're displaying a chart. So the dark blue bar represents permit schedule violations, which occur when a facility doesn't submit a report on time or other schedule violations. Here this type of violation represents the majority of violations and there's been a significant uptick in them starting around 2015. The teal bar represents the number of effluent violations reported each year. The orange is the count of compliant schedule violations, a compliant schedule violation occurs when there's a water quality target that is trying to be reached for a certain pollutant in an area. There should be a consistent uptick and these starting around 2008, where before there didn't really seem to be any at all. Finally, the pink bar of which there doesn't seem to be any on this graph is the count of single event violations. So the cv allows us to actually save the data we just visualized in Part B in a CSV format to download on our own computer. So once I've run Part C here, I can navigate to the left hand side of my screen to this little file folder, and then I'm going to open CSVs. Here I have Clean Water Act violations for the Huckton watershed for that intersects with my zip code. So I click these three little dots on the right hand side of the CSV and it'll download to my computer so you can save it for later and dig into it more and visualize it however you like. And you can also rerun Part B and C as many times as you want to look at different data sets for your watershed so I can change my selection here, change the type of visualization I want to produce, and run Part B again and if I want to save that information I can run Part C again and walk through the saving on the left side here. So then Step 7 allows us to choose a specific facility or pollutant to look at more closely. After running Part 7A here, click what you'd like to focus on which is a specific facility or a specific pollutant and you can, like Part 6 you can run this again and make a different selection later. So here I've selected to look at a particular pollutant and then I'm going to go ahead and run 7B here which generates a list of facilities and pollutants reported in this watershed. So because I selected pollutants above and this step I'm going to choose a particular pollutant to visualize and I went ahead and chose benzene and I decided to map that. I'm going to go ahead and running 7C generates the results of benzene, a map of benzene for this watershed so looking at this map, the orange circles indicate a facility that is reported discharging benzene or whatever pollutant we chose. The black dot indicates other facilities in their watershed that report under the Clean Water Act but didn't discharge benzene in 2020. And finally the size of the orange circle corresponds to the magnitude of the data for a specific facility. So for example, if a facility reports emitting 1000 pounds of benzene that would be represented by a larger circle than the facility that reports emitting 100 pounds of benzene. So if we zoom in a little more here we can see the concentration of benzene discharges occurs in three areas in this watershed this upper north area this eastern area and then the southeastern area with most of them occurring in the southeastern corner. So if we want to zoom in further, we can see who the largest emitters of benzene seem to be these largest circles and figure out what these actual facilities are and learn more about them that way. So like sub six you can change your selections and part seven a so I could choose facilities this time and choose a specific facility and visualize data about that facility. And then after seven after sub seven is the appendix, which includes some coming soon tools so there's not much you need to do here yet. And that brings us to the end of the watershed notebook walkthrough. Hope this was helpful to you. If you have any questions or would like some additional help, please don't hesitate to reach out to us at edgy. If you'd like to see more at environmental enforcement watch org, you can reach out to us at enviro dgi at gmail.com or on Twitter at EEW underscore network. Thank you so much. Bye bye.