Skip to content

Commit

Permalink
Update 02_chapter02_search_engine_for_data.Rmd
Browse files Browse the repository at this point in the history
  • Loading branch information
odwb authored Oct 31, 2023
1 parent 4ecf69a commit aed204e
Showing 1 changed file with 6 additions and 2 deletions.
8 changes: 6 additions & 2 deletions 02_chapter02_search_engine_for_data.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -4,11 +4,15 @@ output: html_document

# The features of a modern data dissemination platform {#chapter02}

In this chapter, we delineate the essential features that a contemporary online data catalog should possess to effectively serve the diverse needs and expectations of its users. Our goal is to offer suggestions for developing data catalogs that incorporate advanced search capabilities, such as lexical search, semantic search utilizing natural language processing (NLP), filtering, targeted search, and a recommender system. In defining these features, we adopt three perspectives: the perspective of data users, who represent a highly diverse community with varying needs, preferences, expectations, and capacities; the perspective of data suppliers, who publish their data or entrust a data library to do so; and the perspective of catalog administrators, who are responsible for curating and disseminating data in a responsible, effective, and efficient manner while optimizing user and supplier satisfaction.
In the introductory section of this Guide, we proposed that a data dissemination platform should be modeled after highly successful e-commerce platforms. These platforms are designed to optimally satisfy the requirements and expectations of both buyers (in our context, the data users) and sellers (in our context, the data providers who make their datasets accessible through a data catalog). In this chapter, we outline the crucial features that a modern online data catalog should incorporate to adhere to this model and effectively cater to the diverse needs and expectations of its users.

Our objective is to provide recommendations for developing data catalogs that encompass lexical search and semantic search, filtering, advanced search functionality, interactive user interfaces, and the capability to operate as a data recommender system. To define these features, we approach the topic from three distinct perspectives: the viewpoint of data users, who represent a highly diverse community with varying needs, preferences, expectations, and capabilities; the standpoint of data suppliers, who either publish their data or delegate the task to a data library; and the perspective of catalog administrators, responsible for curating and disseminating data in a responsible, effective, and efficient manner while optimizing both user and supplier satisfaction.

The creation of a contemporary data dissemination platform is a collaborative endeavor, engaging data curators, user experience (UX) experts, designers, search engineers, and subject matter specialists with a profound understanding of both the data and the users' requirements and preferences. Inclusive in this development process should be the active participation of the users themselves, allowing them to provide feedback that directly influences the system's design.

## Features for data users

To create a positive user experience, online data catalogs must provide an intuitive and efficient interface that enables users to easily access the most relevant datasets. This requires a combination of user-friendly search tools and filters, commonly referred to as facets. To meet user expectations, we can draw inspiration from the design of successful search engines like Google or Bing, as well as e-commerce platforms such as Amazon, which prioritize simplicity, predictability, relevance, speed, and reliability. Incorporating these principles into the design of data catalogs can provide users with a seamless and user-friendly experience that mirrors the ease and convenience of popular search engines and e-commerce sites, facilitating the quick and effortless discovery and acquisition of the data they require.
In order to cultivate a favorable user experience, online data catalogs must offer an intuitive and efficient interface, allowing users to effortlessly access the most pertinent datasets. To meet user expectations effectively, one should emphasize simplicity, predictability, relevance, speed, and reliability. Integrating these principles into the design of data catalogs can deliver a seamless and user-friendly experience, akin to the convenience and ease provided by well-known internet search engines and e-commerce platforms. This, in turn, streamlines the process of discovering and obtaining the necessary data, making it quick and hassle-free for users.

### Simple search interface

Expand Down

0 comments on commit aed204e

Please sign in to comment.