| 1. | datasets |
gathering key data about medical practices 2025-12-26 12:08:23 EST |
| 2. | datasets |
Struggling to extract data from 1,500+ mixed scanned/digital PDFs. Tesseract, OCR, and Vision LLMs all failing. Need advice. 2025-12-26 11:43:42 EST |
| 3. | datasets |
How do you efficiently pre-filter and group WhatsApp numbers to boost engagement? 2025-12-26 05:45:53 EST |
| 4. | datasets |
I made a website that showcases the 311 requests dataset 2025-12-25 19:40:55 EST |
| 5. | datasets |
Has anyone tried letting AI agents access your data and pay per request? 2025-12-25 19:35:08 EST |
| 6. | datasets |
Dataset of 5k high-quality trivia questions pulled from open trivia 2025-12-25 16:57:06 EST |
| 7. | datasets |
Historical Canadian Infectious Disease Data 2025-12-25 05:48:44 EST |
| 8. | datasets |
Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. 2025-12-24 21:09:11 EST |
| 9. | datasets |
Tomato leaf dataset containing environmental conditions such as different humidity and lightning factors 2025-12-24 19:01:13 EST |
| 10. | datasets |
Looking for Wheat Yellow Rust Image Datasets for ML Project (with Metadata) 2025-12-24 14:14:50 EST |
| 11. | datasets |
Does a corpus of archaic English words exist? 2025-12-24 13:02:11 EST |
| 12. | datasets |
Looking for a long-term collaborator – Data Engineer / Backend Engineer (Automotive data) 2025-12-23 12:14:37 EST |
| 13. | datasets |
What packaging and terms make a dataset truly "enterprise-friendly"? 2025-12-23 09:33:15 EST |
| 14. | datasets |
For large web‑scraped datasets in 2025 – are you team Pandas or Polars? 2025-12-23 06:04:44 EST |
| 15. | datasets |
Update to this: In the google drive there are currently two csv files in the top folder. One is the raw dataset. The other is a dataset that has been deduplicated. Right now, I am running a script that tries to repair the OCR noise and mistakes. That will also be uploaded as a unique dataset. 2025-12-23 05:57:17 EST |
| 16. | datasets |
Looking for dataset for AI interview / behavioral analysis (Johari Window) 2025-12-23 05:38:33 EST |
| 17. | datasets |
ScrapeGraphAI 100k: 100,000 Real-World Structured LLM Output Examples from Production Usage 2025-12-23 03:17:40 EST |
| 18. | datasets |
Football (Soccer) data - Players (without game analysis) 2025-12-22 14:02:26 EST |
| 19. | datasets |
Looking to make video game datasets by reading game memory. 2025-12-21 23:01:58 EST |
| 20. | datasets |
Identifying high growth github repositories 2025-12-21 08:42:23 EST |
| 21. | datasets |
I’m trying to "Moneyball" US High Schools to see which ones are actually D1 athlete factories. Is there a clean dataset for this? 2025-12-21 07:46:26 EST |
| 22. | datasets |
[Project] FULL_EPSTEIN_INDEX: A unified archive of House Oversight, FBI, DOJ releases 2025-12-21 04:33:28 EST |
| 23. | datasets |
Help me figure out what to do with this massive Israeli car data file I stumbled upon 2025-12-20 19:02:17 EST |
| 24. | datasets |
IPL 2025 DATASET on #kaggle via @KaggleDatasets 2025-12-20 06:13:42 EST |
| 25. | datasets |
favorite article ive read in a long time 2025-12-18 19:51:27 EST |
| 26. | datasets |
Weekly Pricing Snapshots for 500+ Online Brands (Free, MIT Licensed) 2025-12-18 15:51:48 EST |
| 27. | datasets |
Esports DFS dataset: CS2 match stats + player game logs + prop outcomes (hit/miss) 2025-12-18 12:05:08 EST |
| 28. | datasets |
How does your organization find outsourcing vendors for data labeling? 2025-12-18 09:03:29 EST |
| 29. | datasets |
Embeddings for the Wikipedia link graph 2025-12-18 06:42:30 EST |
| 30. | datasets |
DataSetIQ Python Library - Millions of datasets in Pandas 2025-12-17 23:18:14 EST |
| 31. | datasets |
SEC Filing Word Counts 1993-2000 Dataset [GitHub] 2025-12-17 14:55:51 EST |
| 32. | datasets |
Speed runs of games on twitch archive.org backup 2025-12-17 12:00:13 EST |
| 33. | datasets |
Need an unclean dataset for a special ML project 2025-12-17 08:27:44 EST |
| 34. | datasets |
Can anyone help me find Yahoo! Music User Ratings dataset R2 (also known as R2-Yahoo! Music) ? 2025-12-16 14:58:29 EST |
| 35. | datasets |
Sales analysis yearly report- help a newbie 2025-12-16 10:56:09 EST |
| 36. | datasets |
Winter Heating Costs by State: Where Home Heating Will Cost More in 2025–2026 2025-12-16 10:24:17 EST |
| 37. | datasets |
Need help ds students🙏 i wanna do a small project in ds if u have any unique ideas pls do share 2025-12-16 08:42:38 EST |
| 38. | datasets |
[Dataset] Multi-Asset Market Signals Dataset for ML (leakage-safe, research-grade) 2025-12-15 20:12:08 EST |
| 39. | datasets |
Github Top Projects from 2013 to 2025 (423,098 entries) 2025-12-15 12:53:23 EST |
| 40. | datasets |
KashRock API is in Public Beta — normalized player props + DFS + esports + odds (looking for testers) 2025-12-15 09:55:32 EST |
| 41. | datasets |
Synthetic dataset for emotion detection tasks 2025-12-15 09:15:53 EST |
| 42. | datasets |
Football Manager 2023 Players Dataset 2025-12-15 09:07:13 EST |
| 43. | datasets |
How do I scrape data from a subreddit? 2025-12-15 06:20:22 EST |
| 44. | datasets |
Any recs for solid data analysis tools that don't leak my info? 2025-12-15 03:28:10 EST |
| 45. | datasets |
A common question: What are the most time-consuming steps when you're doing data analysis? What moments during data processing make you feel the most "mentally exhausted"? 2025-12-14 22:46:12 EST |
| 46. | datasets |
How do you decide when a messy dataset is “good enough” to start modeling? 2025-12-14 22:22:04 EST |
| 47. | datasets |
Daily birth statistic from USA and England & Wales 2025-12-14 17:21:42 EST |
| 48. | datasets |
Seeking tips for a paid dataset of Twitter (X) high-follower count contact info / emails 2025-12-13 13:12:58 EST |
| 49. | datasets |
i done mt first project Spotify trends and popularity analysis 2025-12-13 11:27:23 EST |
| 50. | datasets |
Request for CRSP & Compustat data on WRDS 2025-12-13 06:01:17 EST |
| 51. | datasets |
I structured the entire Digimon evolution web into a clean JSON API. 2025-12-12 14:31:19 EST |
| 52. | datasets |
Synthetic dataset for chatbot Intent Detection tasks 2025-12-12 11:19:33 EST |
| 53. | datasets |
Full 2026 World Cup Match Schedule (CSV, SQLite) 2025-12-11 15:14:56 EST |
| 54. | datasets |
High dimensional dataset: any ideas? 2025-12-11 04:56:41 EST |
| 55. | datasets |
TrumpTracker. 2005 actions tracked and categorised 2025-12-11 04:48:46 EST |
| 56. | datasets |
Large-scale image dataset of perceptual hashing? 2025-12-10 08:26:15 EST |
| 57. | datasets |
image dataset for deepfake detection 2025-12-10 08:25:42 EST |
| 58. | datasets |
[HIRING] $20-30/hr, First-person video recording of work tasks and household tasks (10-20 hr/wk, remote) 2025-12-10 05:17:10 EST |
| 59. | datasets |
Football match datasets – Specification of event times for each match in a given competition 2025-12-09 16:32:33 EST |
| 60. | datasets |
I scraped 200k+ reviews from Mercado Livre. Here is the dataset for your NLP projects. 2025-12-09 13:52:31 EST |
| 61. | datasets |
How Google Maps quietly allocates survival across London’s restaurants - and how I built a dashboard to see through it 2025-12-09 07:19:31 EST |
| 62. | datasets |
Need Community Help - Creation of a Custom Dataset 2025-12-09 04:35:55 EST |
| 63. | datasets |
Is the site down? https://archive.ics.uci.edu/ 2025-12-08 23:04:50 EST |
| 64. | datasets |
What's the best way to get a Music Dataset? 2025-12-08 21:21:43 EST |
| 65. | datasets |
Does anyone have a list/spreadsheet of every ski resort in the world and its founding date? 2025-12-08 16:43:10 EST |
| 66. | datasets |
Scientists just released a map of all 2.75 billion buildings on Earth, in 3D 2025-12-08 16:30:30 EST |
| 67. | datasets |
Seeking B2B Data Vendor for State Unclaimed Property Records 2025-12-08 13:15:56 EST |
| 68. | datasets |
ICE: Immigration and Customs Enforcement Immigration and Customs Enforcement USA 2025-12-08 10:24:39 EST |
| 69. | datasets |
behindthename dataset / csvs with names origin and descriptions of lots of names 2025-12-08 09:27:22 EST |
| 70. | datasets |
Publicly available datasets with results and standings 2025-12-07 16:53:22 EST |
| 71. | datasets |
The Planetary Exploration Budget Dataset 2025-12-07 06:57:15 EST |
| 72. | datasets |
Data-Driven “Men’s Global Wellbeing Index” Project (With Domain + Dashboard + Dataset) 2025-12-07 01:37:27 EST |
| 73. | datasets |
how can i get a real time bitcoine news 2025-12-06 18:16:08 EST |
| 74. | datasets |
Is there a better model than snapshot + timestamp for temporal datasets? 2025-12-05 03:43:13 EST |
| 75. | datasets |
Conversational audio dataset from one speaker 2025-12-04 18:26:29 EST |
| 76. | datasets |
Students and the effects of social media 2025-12-04 12:27:55 EST |
| 77. | datasets |
data quality best practices + Snowflake connection for sample data 2025-12-04 05:45:32 EST |
| 78. | datasets |
Patterns in data! Is there any no-code solution? 2025-12-04 05:01:42 EST |
| 79. | datasets |
[Resource] 20,000+ Pages of U.S. House Oversight Epstein Estate Docs (OCR'd & Cleaned for RAG/Analysis) 2025-12-03 18:32:15 EST |
| 80. | datasets |
Hello, I am in the need for 'big' dataset. 2025-12-03 10:08:45 EST |
| 81. | datasets |
Downloading select files / Avoiding downloading entire datasets 2025-12-03 09:05:34 EST |
| 82. | datasets |
We built a database of 290,000 English medieval soldiers – here’s what it reveals 2025-12-03 03:46:35 EST |
| 83. | datasets |
Are there any open access Crop Row datasets like CRBD? 2025-12-03 01:13:36 EST |
| 84. | datasets |
Need ideas for utilizing gcp's $300 free credits in the next three days and get the most long term value out of it (something that stays even after the credits expire) 2025-12-02 17:37:52 EST |
| 85. | datasets |
Benchmarked TabPFN on 1M-10M row datasets 2025-12-02 12:54:59 EST |
| 86. | datasets |
Guidance on beginning a Data project on Matcha and its rise 2025-12-02 10:10:12 EST |
| 87. | datasets |
Looking for science education data sets 2025-12-02 08:38:26 EST |
| 88. | datasets |
96 million iNaturalist research-grade plant records dataset (free and open source) 2025-12-02 00:38:15 EST |
| 89. | datasets |
TagPilot - image dataset preparation tool 2025-12-01 16:27:06 EST |
| 90. | datasets |
Synthetic HTTP Requests Dataset for AI WAF Training 2025-12-01 15:30:57 EST |
| 91. | datasets |
I Asked an AI to “Generate a Poor Family” 5,000 Times. It Mostly Gave Me South Asians. 2025-12-01 09:45:38 EST |
| 92. | datasets |
Tiktok Trending Hashtags Dataset (2022-2025) 2025-12-01 09:23:37 EST |
| 93. | datasets |
Can you actually make money building and running a digital-content e-commerce platform from scratch? "I Will not promote" 2025-12-01 05:44:31 EST |
| 94. | datasets |
Is there a reproducible way to quantify dataset drift over time? 2025-12-01 01:47:11 EST |
| 95. | datasets |
Zillow removes data on risk of homes to disasters. Did anyone scrape it in advance? 2025-11-30 18:38:17 EST |
| 96. | datasets |
Data Share Platform (A platform where you can share data, targeted more towards IT people) 2025-11-30 13:56:13 EST |
© Copyright hackingai.app 2022-2023 Release Version 1.01202023
Return to top ►