Trending

#Datasets

Latest posts tagged with #Datasets on Bluesky

Posts tagged #Datasets

Preview
Croissant - MLCommons The MLCommons Croissant working group standardizes how ML datasets are described to make them easily discoverable and usable across tools and platforms.

Croissant - MLCommons #data #datasets #ai #ml #machineLearning #medlibs
mlcommons.org/working-grou...

3 0 0 0
Preview
Netnut – Proxy, Reviews, Pros & Cons, Alternatives | Caproxy Netnut is a popular proxy provider that has been operating since 2017, with its headquarters in Israel. It is known for its high-quality proxies and is pr...

Netnut is a popular proxy provider that has been operating since 2017, with its headquarters in Israel. It is known for its high-quality proxies and is primarily focused on corporate clients.

caproxy.com/en/list/netn...

#netnut #proxy #proxies #caproxy #datasets #scrapers

2 0 0 0

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

Hannes Kath, Thiago S. Gouvêa, Daniel Sonntag

Action editor: Kamalika Chaudhuri

https://openreview.net/forum?id=q6hRb6fETo

#performance #iterative #datasets

0 0 0 0
Examining reference data in the StatsWales publishing service — Register Dynamics - data focused technology consultancy based in the UK Recently we have undertaken an examination of the potential for reference data within the StatsWales publishing service . This is a service provided by the Welsh Government where data from public sector organisations can be published for use by interested parties. It comprises over a thousand dat

💡Public sector data is powerful—but only if consistent and comparable.
We examined #StatsWales and found common dimensions like geography, age, or ethnicity are handled inconsistently across #datasets. Explore what we did:
www.register-dynamics.co.uk/blog/examini... 

#ReferenceData #OpenData

0 0 0 0

Hello, je cherche des éléments de type #datasets de carte scolaire pour les établissements écoles primaires, élémentaires. J'ai un peu écumé tout ce que je voyais un peu partout sans succés.
Connaissez vous des datasets de ce type qui permettraient de faire de la #carto. #dev #carto

0 0 1 0
Preview
GitHub - VLa-Labs/Norwegian-Language-Dataset-List: A curated collection of 33 public Norwegian language datasets metadata. A curated collection of 33 public Norwegian language datasets metadata. - VLa-Labs/Norwegian-Language-Dataset-List

Working on 🇳🇴 Norwegian NLP?

Here’s a curated collection of 33 Norwegian language datasets, with dataset links and original paper references. A practical entry point to the Norwegian NLP / language technology landscape!

📌 Link: github.com/VLa-Labs/Nor...

#Norwegian #NorwegianNLP #NLP #Datasets #ML

1 0 0 0

Are Time-Indexed Foundation Models the Future of Time Series Imputation?

Etienne Le Naour, Tahar Nabil, Adrien Petralia, Ghislain Agoua

Action editor: Jes Frellsen

https://openreview.net/forum?id=cTk56KpsP5

#imputation #tsfm_imputation #datasets

0 0 0 0
Post image

I'm a beginner documenting my data journey and this is the list I wish I had from day one. Full article link in the comments. #DataAnalytics #DataJourney #Lifelonglearner #Blackwomenintech #Medium #Free #Datasets

0 0 0 0

Finally Outshining the Random Baseline: A Simple and Effective Solution for Active Learning in 3D...

Carsten T. Lüth, Jeremias Traub, Kim-Celine Kahl et al.

Action editor: Jose Dolz

https://openreview.net/forum?id=UamXueEaYW

#dataset #datasets #segmentation

0 0 0 0
Post image

Research Paper (preprint) "Linking Global #Science #Funding to Research #Publications" arxiv.org/pdf/2603.24147 #publications #scholcomm #datasets #data #funders

1 0 0 0
A dataset of insect sounds from 459 species for bioacoustic machine learning - Scientific Data Scientific Data - A dataset of insect sounds from 459 species for bioacoustic machine learning

New paper from us: "A dataset of insect sounds from 459 species for bioacoustic machine learning", published in Scientific Data, led by Marius Faiß https://doi.org/10.1038/s41597-026-07123-4 #bioacoustics #datasets

24 15 1 1
Preview
NIST Helps Fingerprint Examiners With New Data and Software Release The new tools are an annotated collection of 10,000 fingerprints and a software program that can sort fingerprints according to their quality.

#crime #forensics #datasets #fingerprints #NIST #AI

'A NIST collection of 10,000 fingerprints has now been fully annotated with details that will help train both human fingerprint examiners and AI tools.'

www.nist.gov/news-events/...

2 0 0 1

Theoretically Understanding Data Reconstruction Leakage in Federated Learning

Binghui Zhang, Zifan Wang, Meng Pang, Yuan Hong, Binghui Wang

Action editor: Jinghui Chen

https://openreview.net/forum?id=1UfDXeYxwk

#federated #privacy #datasets

0 0 1 0
Preview
First versions of harmonised data now available | Infra4NextGen As part of the Infra4NextGen project, harmonised datasets for each of the five themes have been published for the first time on the NextGen Harmonised Data Gateway. These initial versions include…

Harmonised #datasets for the five themes of the NextGenerationEU recovery plan are now available for download.

These files include #data from five major #surveys that has been #harmonised to make it as comparable as possible, even if the #question text and response scales differed.

1 1 0 0

New #J2C Certification:

Reasoning-Driven Synthetic Data Generation and Evaluation

Tim R. Davidson, Benoit Seguin, Enrico Bacis, Cesar Ilharco, Hamza Harkous

https://openreview.net/forum?id=NALsdGEPhB

#generate #annotators #datasets

0 0 0 0
Post image

⛰️🌍 Mountains are underrepresented in global #datasets, yet are critical for understanding #ClimateChange & its impacts.

Strengthening #observations in #OurChangingMountains is key. 🗝️

MRI contributed this perspective at last month's Global Climate Observing System #GCOS meeting.

📖👉️ buff.ly/3JMiBjv

4 1 0 0
Preview
★★★★★ Private B2B data broker Verified business & consumer intelligence datasets: executive contacts, firmographics, and market insights. Transparent pricing—no quotes required.

Business & Consumer Intelligence You Won’t Find Anywhere Else

Structured datasets on companies, executives, consumers, and behavioral signals—ready for research, analysis, segmentation, or integration into your workflows.

mediumaxis.com

#datasets #intelligence #leadgeneration

0 0 0 0
Preview
DataSeer develops AI system to track dataset reuse - Research Information Tool has ability to "systematically track data reuse giving a new lens on openness, research integrity, and downstream impact"

DataSeer develops AI system to track dataset reuse: www.researchinformation.info/news/datasee...

#Data #LLM #LargeLanguageModel #LLM #OpenScience #OpenAccess #OA #Datasets #Stratos #AI #ArtificialIntelligence #ResearchData #DataSeer #Grants #MJFF

0 0 0 0

On the Importance of Pretraining Data Alignment for Atomic Property Prediction

Yasir M. Ghunaim, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Action editor: Changyou Chen

https://openreview.net/forum?id=jfD9BsrDTb

#dataset #datasets #inception

0 0 0 0

But large #datasets bring challenges:
• Bias in digital data sources
• Measurement validity issues
• Risks of overfitting models

Therefore, validation and replication are essential in CSS research.

0 0 1 0
resumen ejecutivo del informe de datasets españoles en Zenodo

resumen ejecutivo del informe de datasets españoles en Zenodo

Ya está publicado el informe de #datasets de universidades españolas en #Zenodo con datos de diciembre-2025. Más conjuntos pero menor nivel de descripción. No se debe bajar la guardia. Las bibliotecas universitarias algo deben de hacer. www.javima.info/ciencia-abie...
#CienciaAbierta

0 0 0 0
Eye-Tracking-While-Reading Datasets

👀 📣 To all users of eye-tracking-while-reading datasets: check out our comprehensive, filterable dataset overview!

Dataset overview: dili-lab.github.io/datasets.html

Preprint: arxiv.org/abs/2602.19598

Add or edit your dataset: www.cl.uzh.ch/en/research-...

#FAIR #eyetracking #datasets

2 1 0 0
Preview
Scientists warn fake research is spreading faster than real science A sweeping new study from Northwestern University reveals that scientific fraud is no longer just the work of a few rogue researchers—it has evolved into a global, organized enterprise. By analyzing…

"By analyzing massive #datasets .. #researchers uncovered networks involving “paper mills,” brokers, and compromised journals that systematically produce and sell fake #research, authorship slots, and #citations.": buff.ly/YJ4bqBU

via sciencedaily
#science #MedSky #research #ResearchJournals

7 4 0 0
Preview
Why Austria? A Prime Telemarketing Goldmine In today's fast-paced digital landscape, businesses are constantly seeking efficient ways to connect with high-value leads. For marketers ta...

Enter 100% verified active #AustriaWhatsApp #numberdata from trusted #WhatsAppDatabase companies. These premium #datasets offer a #gamechanging solution for #telemarketing and direct call marketing #campaigns, delivering unmatched accuracy, and ROI
buywhatsappdatabase247.blogspot.com/2026/03/aust...

0 0 0 0
Post image

The scryptIQ #machinelearning module covers both supervised and unsupervised learning methods: namely the classification and clustering of different #biological #datasets, including images.

scryptiq.ai

0 0 0 0
Science is more than papers

Science is more than papers

153M+ research outputs in the #OpenAIREGraph are linked to #datasets & #software
A growing web of connections allowing us to see how knowledge is built across publications, data & code, not just the final paper.
Explore connections
🔗 #GraphAPI shorturl.at/oRotk
🔗 #OpenAIRE EXPLORE shorturl.at/RIZoh

2 1 0 0

New #J2C Certification:

Probabilistic Pretraining for Improved Neural Regression

Boris N. Oreshkin, Shiv Kumar Tavker, Dmitry Efimov

https://openreview.net/forum?id=F6BTATGXaf

#datasets #tabpfn #regression

0 0 0 0
Post image

BGS' BritPits map shows the distribution of worked mineral commodities across the UK - tinyurl.com/5ydmtaf6

#Aspermont #BritishGeologicalSurvey #BritPits #MineralResources #MineralPlanningAuthority #Geology #Datasets

0 0 0 0
Post image

From Reflection to Repair: A Scoping Review of Dataset Documentation Tools" (new preprint via ArXiv) arxiv.org/abs/2602.15968 #data #datasets #rdm

0 1 0 0
Post image

Discussing AI in the sphere of geological modelling with respect to the tunnelling industry - tinyurl.com/54bxc7bs

#Aspermont #COWIfonden #UniversityofStrathclyde #TechnicalUniversityofDenmark #COWI #AI #Tunnelling #GroundInvestigation #DataSets #GeologicalModelling

0 0 0 0