Description Search engines make archives, inventories, websites, large publishing platforms, and the internet navigable. Using the search engine we built at WordPress.com, a platform where more than 90 million blog posts and web pages are published per month, as an example, I will explain how search engines work and how their performance can be evaluated.
Abstract Search engines make archives, inventories, websites, large publishing platforms, and the internet navigable. In this talk, I will introduce the basic building blocks of a search engine.
WordPress.com is a platform where anyone can create a blog or website, and read posts that are published there and elsewhere. More than 90 million posts and pages are published on WordPress.com per month. To make it easier for our users to find the ones they are most interested in, we built a search engine that is based on Elasticsearch.
In order to keep improving our search algorithm, we need to track and analyze our user’s interactions with the search results. I will present an overview of the metrics we and others use to evaluate the performance of search algorithms.
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.