Metadata-Version: 2.3
Name: acace_text_preprocessor
Version: 0.1.1
Summary: Text preprocessing utilities for ACACE
Author: ACACE Team
Author-email: team@acace.ai
Requires-Python: >=3.8,<4.0
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Dist: nltk (>=3.8.1,<4.0.0)
Requires-Dist: numpy (>=1.26.3,<2.0.0)
Requires-Dist: regex (>=2023.12.25,<2024.0.0)
Requires-Dist: spacy (>=3.7.2,<4.0.0)
Requires-Dist: unidecode (>=1.3.7,<2.0.0)
Description-Content-Type: text/markdown

# ACACE Text Preprocessor

A text preprocessing module for the Adaptive Context-Aware Content Engine (ACACE).

## Features

- HTML tag removal and entity decoding
- Whitespace normalization
- Simple and efficient text cleaning

## Installation

```bash
pip install acace_text_preprocessor
```

## Usage

```python
from acace_text_preprocessor import preprocess_text

# Clean and normalize text
cleaned_text = preprocess_text("Your <b>HTML</b> text with   extra   spaces")
print(cleaned_text)  # Output: "Your HTML text with extra spaces"
```

## License

MIT License

