 I'm going to be doing a summary of all available biomedical data sets. There's quite a lot. Yeah, the good thing about the GPT-4 now actually providing links. I'm more familiar with the EEG ones that gave actually not know any of those, to know if they are legit or not. Let's provide the links now from where it actually took the information. So this is from a readme file from six months ago. Looks like quite a comprehensive list of EEG data sets. Yeah, IEG.org is not on the list. Yeah, so this is on the publicly available EEG data sets on OpenBCI. In this case, we have this NMT. This name comes up a lot. SculpeyG data set opensource annotated data set of healthy and pathological EEG recordings for predictive modeling. Now that's all nice and good. There's a standard montage. So that's nice. Some histograms on the population. Age and abnormal, normal EEG data set structure that looks all great. But obviously one will be looking for the link to the actual data, which must be hiding somewhere at the bottom of the article. There it is. When you open it, it sends you to this weird university website. It has some data set called repository download. Yeah, that's obviously not related. Here it is halfway through the list. I have code sitting on key tab and then you have this zip file sitting on this chatbotmart.com. And when you open this one, it goes to GoDaddy. You can buy chatbotmart.com or I don't know how much. And obviously the data is not available. So that's zip file. You know, when linking from the paper is nowhere to be found. You of course can try the a key tab. This also has link to the data set from two years ago. Data folder labels good. Doesn't seem to have the actual data. Right, this one now has a different link, which goes to the same thing with chatbotmart. Are all of them on chatbotmart? No, anyway, the links are just weird. Yeah, from all over the shop. So I'm not sure if this data set is available anywhere. So yeah, that's what happens to most of public data sets. So this lists here, I guess that only like 1% or so under 10% of these data sets will actually be available. Some will require some sort of crazy registration. And some just essentially not be available anymore. But yes, we will be essentially creating a list like that and indicating, you know, essentially some sort of difficulty score of how easy it was or how difficult it was to actually get hold of the data. And once you do get hold of the data, the main question then is if the data is in its raw format or some sort of metadata down sampled or processed over there, which we do not like.