Text and Data Mining Policy
Last updated: December 16, 2025
1. Introduction
Universo Pet Tecnologia Ltda. ("Universo Pet", "we", "our") exercises its rights to reserve text and data mining (TDM) in accordance with the General Data Protection Law (LGPD - Law No. 13.709/2018) and the European Copyright Directive in the Digital Single Market (EU Copyright Directive - CDSM, Article 4).
This policy establishes the conditions under which third parties may access, track and process content from our website universo-pet.com for text and data mining purposes.
2. Definitions
For the purposes of this policy, the following definitions apply:
Text and Data Mining (TDM): Automated techniques for analyzing digital content to extract information, patterns, trends and knowledge, including but not limited to web scraping, crawling and training of Artificial Intelligence (AI) models.
Public Content: Pages, texts, images and other materials publicly available on universo-pet.com without requiring authentication.
Protected Content: Authenticated user data, private APIs, administrative areas and any personal information protected by LGPD.
AI Crawlers: Automated bots from AI companies that collect data for language model training (e.g., GPTBot, Google-Extended, CCBot).
Search Crawlers: Search bots that index content to provide direct answers to users (e.g., ChatGPT-User, ClaudeBot, PerplexityBot).
3. TDM Rights Reservation
In accordance with Article 4 of the EU CDSM Directive and Article 18 of LGPD, Universo Pet expressly reserves its rights over content published on its website.
This means that unauthorized use of our content for text and data mining, including training of commercial AI models, is prohibited, except when expressly permitted in this policy or through prior written authorization.
4. Permitted TDM Uses
The following uses of text and data mining are expressly permitted without requiring additional authorization:
4.1. Indexing for Search and Answers
We allow search engines and AI assistants to index our public content to provide direct answers to end users (search/inference), provided that:
They respect the limits established in our
robots.txtfileThey cite the source as "Universo Pet (universo-pet.com)" when using our content in answers
They include our critical disclaimers (e.g., "Dr. Paws does not replace licensed veterinarian")
They do not access protected content or private areas
Crawlers allowed for search/inference:
ChatGPT-User, ChatGPT-User/2.0, OAI-SearchBot
ClaudeBot, Claude-SearchBot
PerplexityBot
Googlebot, Bingbot (traditional engines)
4.2. Academic and Scientific Research
We allow the use of our public content for non-commercial academic research, provided that:
The research is conducted by recognized institutions
Results are published in an open and transparent manner
Universo Pet is cited as a source in published works
There is no commercial use of collected data or trained models
5. Prohibited TDM Uses
The following uses of text and data mining are expressly prohibited:
5.1. Commercial AI Model Training
The use of our content for training commercial Artificial Intelligence models is prohibited without prior written authorization. This includes:
Generative language models (LLMs)
Computer vision models (image recognition)
Multimodal models (text + image + audio)
Any AI system that uses our content as training data
Crawlers blocked for AI training:
GPTBot (OpenAI training bot)
Google-Extended (Google AI training)
CCBot (Common Crawl)
Anthropic-AI, cohere-ai, Omgilibot, FacebookBot (when used for training)
5.2. Access to Protected Content
The following is strictly prohibited:
Accessing areas that require authentication (dashboard, user profiles, veterinary consultations)
Collecting personal user data (names, emails, pet medical history)
Scraping private APIs (/api/*)
Bypassing technical protection measures (rate limiting, CAPTCHAs, robots.txt)
5.3. Malicious or Competitive Use
The use of TDM for the following is prohibited:
Creating competing products or services based on our content
Replicating our veterinary knowledge base in other systems
Distributing datasets or text corpora that include our content
Using our content to train competing veterinary virtual assistants
6. Technical Implementation of TDM Policy
This policy is technically enforced through the following mechanisms:
6.1. TDM Reservation Protocol (TDMRep)
We provide a JSON file at /.well-known/tdmrep.json in accordance with the TDM Reservation Protocol, which formally declares our rights reservation and specifies allowed/protected scopes.
6.2. Robots.txt
Our /robots.txt file contains specific directives for each user-agent, allowing search crawlers and blocking training crawlers.
6.3. HTML Meta Tags
Protected pages include noindex, nofollow and noarchive meta tags when applicable.
6.4. Rate Limiting and DDoS Protection
We implement rate limiting and protection against abusive scraping that may impact service availability.
7. TDM Authorization Requests
Organizations wishing to use our content for text and data mining purposes beyond permitted uses must request prior written authorization.
7.1. How to Request Authorization
Send an email to studio.kodaai@gmail.com with the following information:
Organization identification: Name, registration number, address, website
TDM purpose: Detailed description of the project, AI model or research
Content scope: Which pages/data will be collected
Collection period: When TDM will be performed
Commercial use: Whether the result will be used commercially or in paid products
Privacy guarantees: How personal data (if any) will be protected in accordance with LGPD
7.2. Evaluation Criteria
We will evaluate requests based on the following criteria:
Social or scientific benefit of the project
Transparency about the final use of collected data
Compliance with LGPD and applicable legislation
Commitment to proper citation and attribution to Universo Pet
Potential impact on our infrastructure and users
8. Violations and Penalties
Non-compliance with this TDM policy constitutes a violation of our Terms of Use and may result in:
Immediate technical blocking: IPs and user-agents will be permanently blocked
Provider notification: We will inform companies responsible for violating crawlers
Legal action: We reserve the right to take legal measures in accordance with LGPD and copyright legislation
Publication of violators: We may publicly disclose organizations that violate this policy
9. LGPD Compliance
This TDM policy is aligned with the General Data Protection Law (LGPD - Law No. 13.709/2018), especially:
Art. 18, §7: Right of the data subject to object to processing based on consent exemption grounds
Art. 7, IX: Legitimate interest as a legal basis (our interest in protecting intellectual property)
Art. 10: Processing of sensitive personal data (pet medical history) requires explicit consent
If third parties perform TDM on our content, they are responsible for ensuring that any personal data collected is processed in accordance with LGPD.
10. Policy Updates
We reserve the right to update this TDM policy periodically to reflect technological, legal or business model changes.
Significant changes will be communicated through:
Update of the "Last updated" date at the top of this page
Modification of the
"modified"field in the tdmrep.json fileNotification on our blog or official channels (when applicable)
11. Contact
For questions, authorization requests or notifications related to this TDM policy, please contact:
Email: studio.kodaai@gmail.com
Website: https://universo-pet.com
Company Name: Universo Pet Tecnologia Ltda.
Address: Esplanada, Bahia, Brasil
12. Legal Basis
This policy is based on:
LGPD (Law No. 13.709/2018): General Data Protection Law of Brazil
EU CDSM Directive (Article 4): European Directive on Copyright in the Digital Single Market
Copyright Law (Law No. 9.610/1998): Protection of intellectual works in Brazil
Marco Civil da Internet (Law No. 12.965/2014): Rights and duties for Internet use in Brazil
© 2025 Universo Pet Tecnologia Ltda. All rights reserved.
TDM Reservation Protocol: /.well-known/tdmrep.json