Product Overview

From a CSV to the most complete product profile on the internet

See how Central’s 7-phase AI pipeline transforms sparse data into verified intelligence — confidence-scored, anti-hallucination checked, and channel-ready.

The Pipeline

7 phases from raw CSV to verified intelligence

Each phase transforms your product data incrementally. What enters as a sparse row exits as a complete, confidence-scored, channel-ready intelligence profile.

1

Phase 1: Import

Enters

A CSV with titles, SKUs, prices — maybe 5-10 fields

Exits

Structured product records with auto-detected categories. Your data scored at 1.0 confidence.

2

Phase 2: Field Suggestion

Enters

Category-assigned products with minimal fields

Exits

A schema of 50-129 category-specific fields that should exist — weight, noise level, certifications, materials, dimensions. The system knows what's missing.

3

Phase 3: Web Scraping

Enters

Product identifiers (name, EAN, brand)

Exits

10-20 web sources scraped per product — manufacturer sites, retailers, review sites, spec databases. Raw HTML stored for extraction.

4

Phase 4: Field Discovery

Enters

Scraped web pages with unstructured content

Exits

Additional fields discovered from real-world sources that weren't in the original schema. The web reveals what matters.

5

Phase 5: Extraction

Enters

Raw web pages + comprehensive field schema

Exits

Structured field values extracted from every source. Each value tagged with its source URL and extraction confidence.

6

Phase 6: Consolidation (Truth Engine)

Enters

Multiple values per field from multiple sources — often disagreeing

Exits

One canonical value per field, confidence-scored. Multi-source consensus. Disagreements resolved by evidence weight. The Truth Engine.

7

Phase 7: Optimization

Enters

Complete, validated product intelligence profiles

Exits

Channel-ready content: Google Shopping titles, Amazon keywords, meta descriptions, Schema.org markup, Smart Negatives, Living FAQ, contextual specs. Anti-hallucination checked.

Confidence Scoring

A credit score for every fact

Every field in every product profile carries a confidence score from 0.0 to 1.0. Not all data is created equal — and the system knows the difference.

1.0
Brand-owned data
Your own import data. Always trusted. The gold standard.
0.97
5+ independent sources agree
Near-certainty. Multiple independent sources confirming the same value.
0.88
3-4 sources agree
High confidence. Strong consensus across multiple web sources.
0.82
2 sources agree (display threshold)
The minimum for display. Below this, the system stays silent.
0.52
Single source only
Stored but never shown. Silence is better than fiction.

Confidence Hierarchy

Weight: 1,640g 0.97 · 4 sources
Noise Level: 84 dB(A) 0.88 · 3 sources
Ventilation: 5+2 0.82 · 2 sources
Liner: Coolmax 0.52 · 1 source

Below threshold — stored but not displayed

Anti-Hallucination

Every claim checked against 3 source layers

Writing is cheap. Truth is expensive. Every AI-generated claim passes through the Anti-Hallucination Validator, which cross-references against import data, scraped data, and enriched data.

Layer 1: Import Data

Your original data — always scored 1.0. The foundation of truth.

Layer 2: Scraped Data

10-20 web sources per product. Raw, independent observations from across the internet.

Layer 3: Enriched Data

Consolidated, confidence-scored intelligence. Multi-source validated values.

6 violation types detected and blocked

Fabricated Specifications

AI invents a spec that exists in no source. Blocked.

"SNELL certified" — not found in any of 14 sources.

Inflated Measurements

AI exaggerates a numeric value beyond any source. Blocked.

"Battery lasts 72 hours" — best source says 48 hours.

False Certifications

AI claims a certification the product doesn't have. Blocked.

"IP68 waterproof" — product is IP54 rated.

Invented Comparisons

AI makes competitive claims without data basis. Blocked.

"Best in class" — no comparative data exists.

Hallucinated Features

AI adds features that don't exist on the product. Blocked.

"Bluetooth 5.3" — product has no Bluetooth.

Misleading Context

AI provides technically true but misleading framing. Blocked.

"Lightweight at 2.1kg" — heaviest in its category.

The Transformation

What comes out the other side

A sparse CSV row becomes a verified, confidence-scored, channel-ready intelligence profile — automatically.

What goes in

Title Motorcycle Helmet Premium
Price €549.00
EAN 4017765145231
Brand Schuberth
Description Premium materials. High quality finish.

5 fields · 0 validated · No competitive context

What comes out

Weight 0.97

1,640g — lighter than 72%

Noise Level 0.88

84 dB(A) — quieter than 68%

Certification 1.0

ECE 22.06

Smart Negative

Not for track racing — no SNELL/FIM

FAQ Entries

87 product-specific Q&As

87 fields · 67.6% multi-source validated · Channel-perfect

See the pipeline in action with your products

In 30 minutes, we’ll show you your products enriched, your data quality score, and what your customers are missing.