 SCIScinet is a large-scale open data lake for the science of science research. It contains over 134 million scientific publications and millions of external linkages to funding and public uses. This data lake provides detailed documentation of preprocessing steps and analytical choices, as well as computation of frequently used measures in the literature. By providing access to these resources, SCIScinet lowers the barrier to entry, reduces duplication of efforts in data processing and measurements, improves the robustness and replicability of empirical claims, and broadens the diversity and representation of ideas in the field. This article was authored by Zihang Lin, Yein Yin, Lu Liu, and others.