{"id":35856,"date":"2025-10-05T21:27:27","date_gmt":"2025-10-05T19:27:27","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/looking-for-public-datasets-on-consumer-search-behavior-conversational-search-for-academic-research\/"},"modified":"2025-10-05T21:27:27","modified_gmt":"2025-10-05T19:27:27","slug":"looking-for-public-datasets-on-consumer-search-behavior-conversational-search-for-academic-research","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/looking-for-public-datasets-on-consumer-search-behavior-conversational-search-for-academic-research\/","title":{"rendered":"Looking For Public Datasets On Consumer Search Behavior &amp; Conversational Search (for Academic Research)"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hi everyone,<\/p>\n<p>I\u2019m currently conducting a research project comparing traditional search engines (e.g., Google) and LLM-based conversational search tools (e.g., ChatGPT, Perplexity.ai) in the context of consumer search behaviour \u2014 specifically, how users search for and choose products like smartphones when factors such as price and features moderate their decisions. I intend to conduct a controlled experiment to collect search behavior of approximately. 100 participants providing causal evidence, but still want to validate those insights using external datasets or benchmarks.<\/p>\n<p>I\u2019m looking for publicly available datasets that capture one or more of the following aspects:<\/p>\n<ul>\n<li>User\u00b4s background, including age, gender, education, employment, nationality, residence, prior knowledge of AI tools, and shopping-related tools.<\/li>\n<li>Search behavior logs (queries, clicks, scrolls, or multi-turn interactions).<\/li>\n<li>Conversational or query reformulation datasets \u2192 datasets where users ask follow-up questions or clarify queries.<\/li>\n<li>Consumer choice or e-commerce data (based on price or features).<\/li>\n<li>User attitude or satisfaction survey data (e.g., perceived trust, relevance, ease of use, usefulness, overload, decision confidence, and handling contradictory information).<\/li>\n<\/ul>\n<p>Also open to:<\/p>\n<ul>\n<li>Suggestions for benchmark datasets used in <em>Conversational Search<\/em> or <em>Retrieval-Augmented Generation (RAG)<\/em> evaluations<\/li>\n<li>References to recent arXiv or TREC publications releasing such data<\/li>\n<\/ul>\n<p>If anyone here knows of datasets that bridge search interactions \u2014 or newer LLM-integrated conversational search datasets \u2014 I\u2019d really appreciate your input. Thanks in advance!<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/Dismal_Priority_2381\"> \/u\/Dismal_Priority_2381 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1nyxh4g\/looking_for_public_datasets_on_consumer_search\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1nyxh4g\/looking_for_public_datasets_on_consumer_search\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-35856 jlk' href='javascript:void(0)' data-task='like' data-post_id='35856' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-35856 lc'>0<\/span><\/a><\/div><\/div> <div class='status-35856 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hi everyone, I\u2019m currently conducting a research project comparing traditional search engines (e.g., Google) and LLM-based conversational&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-35856","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/35856","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=35856"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/35856\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=35856"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=35856"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=35856"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}