Privacy

A Practical Guide to data privacy issues

Flowsery Team
Flowsery Team
3 min read

TL;DR — Quick Answer

3 min read

Google Analytics retention settings and US-based data storage create compounding GDPR compliance risks that organizations must actively manage or avoid entirely.

This guide explains data privacy issues in practical terms, with a focus on privacy-first analytics decisions.

Google Analytics data retention is easy to misunderstand. In GA4, retention settings affect user-level and event-level data used in Explorations and funnel reports, not standard aggregated reports. Google's documentation lists 2 months and 14 months for standard properties, with longer options for Google Analytics 360, and says data is deleted automatically when it reaches the end of the retention period (GA4 data retention).

That setting is not only an analytics convenience. It is a privacy control.

Why retention matters under GDPR

The GDPR's storage limitation principle requires personal data to be kept in identifiable form only as long as necessary for the purpose. If raw event data is kept because "we might need it someday," the purpose is too vague.

Analytics data can include personal data even when names are absent. Device data, cookie identifiers, user IDs, full URLs, IP-derived location, event parameters, and behavioral sequences can identify or single out a person. Longer retention increases breach impact, access risk, and regulatory exposure.

What GA4 retention does and does not solve

GA4 retention controls can reduce how long event-level data remains available for certain analysis features. But they do not answer every privacy question:

  • Standard aggregated reports are not affected in the same way.
  • BigQuery export creates a separate dataset under your control.
  • Linked products may have their own retention behavior.
  • Downloaded reports and warehouse copies need their own policy.
  • Consent and transfer issues still need separate analysis.

If you export GA4 data to BigQuery, Google says you own that exported data and manage access through BigQuery controls (GA4 BigQuery export). That means retention responsibility moves to you.

Risky retention patterns

Common problems include:

  • Leaving default retention without understanding reporting needs.
  • Exporting raw data to a warehouse with no deletion schedule.
  • Keeping user identifiers in analytics after account deletion.
  • Storing full page URLs that include emails, tokens, or search terms.
  • Giving broad staff access to event-level data.
  • Retaining data for advertising purposes after users opt out.

A better retention model

Use tiers:

Real-time and debugging: hours to days. Useful for deployment checks and incident investigation.

Raw event analysis: 30 to 180 days, depending on product cycles and legal basis.

Aggregated reporting: 12 to 36 months for trend analysis, with no personal identifiers.

Financial or contractual records: separate from web analytics and retained under accounting or legal obligations.

Document the purpose for each tier and automate deletion. Manual deletion policies fail quietly.

Privacy-first analytics advantage

A privacy-first analytics platform that avoids cookies, persistent IDs, fingerprinting, and raw IP storage reduces retention risk from the beginning. Aggregated metrics can often be kept longer because they are less likely to identify individuals. Raw events can be short-lived or avoided entirely.

Flowsery
Flowsery

Start Free Trial

Real-time dashboard

Goal tracking

Cookie-free tracking

The goal is not to delete useful history. It is to keep the most useful form of history: trends, conversions, campaigns, and content performance without unnecessary personal traces.

Checklist

  1. Check GA4 retention settings.
  2. Identify all exports and connected products.
  3. Define retention for raw events, reports, and warehouse tables.
  4. Remove personal data from event parameters.
  5. Restrict access to event-level data.
  6. Document deletion workflows for user requests.
  7. Review retention after major product or legal changes.

Retention is where privacy promises become real. If you cannot say why a dataset still exists, it is probably time to aggregate or delete it.

Retention for event properties

Retention review should include event properties, not only event timestamps. A property such as search_term, account_id, page_location, or checkout_step may carry more privacy risk than the event name. If you need search analytics, consider grouping queries, dropping rare queries, or reviewing terms for sensitive content before storing them.

Access controls

Short retention does not help if too many people can export data while it exists. Limit raw analytics access to people who need it, prefer aggregated dashboards for most stakeholders, and log exports from data warehouses. Analytics data often feels low-risk until it is combined with CRM, billing, or support data. Access policies should assume that joins can increase sensitivity.

Retention Policy Template

Write the policy in business language. Raw analytics events are kept for a short diagnostic window. Aggregated reports are kept longer for trend analysis. Sensitive event properties are blocked or redacted before storage. Exports require a named purpose and expire. Vendor retention settings are reviewed after product launches, campaign changes, and agency handovers. Google's own GA4 documentation on data retention shows that retention settings affect user-level and event-level data differently, so teams should not assume one toggle solves every risk.

For a privacy-first setup, separate three layers. First, real-time operational events used to verify tracking. Second, recent raw events used to debug forms, campaigns, and funnels. Third, aggregate historical metrics used for strategy. Most teams need the third layer far longer than the first two. This design keeps useful history while reducing the chance that old identifiers, URLs, search terms, or accidental personal data remain available years after the original purpose expired.

Retention Audit Actions

Build a retention inventory for GA4 and every connected destination. Record whether enhanced measurement, Google Signals, ads personalization, User-ID, BigQuery export, Consent Mode, cross-domain measurement, and region-specific settings are enabled.

Then separate what must stay raw from what can become aggregate history. Keep GA4 data only where the Google reporting or ads ecosystem has a justified job; move baseline pages, referrers, campaigns, goals, and aggregate funnels to a lower-risk setup where possible.

Was this article helpful?

Let us know what you think!

Before you go...

Flowsery

Flowsery

Revenue-first analytics for your website

Track every visitor, source, and conversion in real time. Simple, powerful, and fully GDPR compliant.

Real-time dashboard

Goal tracking

Cookie-free tracking

Related Articles