# Parser Settings

BotTalk Parser to work correctly, you must first configure it appropriately.

<figure><img src="https://bottalk.io/learn-with-bottalk/content/images/2021/08/parser-settings-bottalk-audio-cms.png" alt="Parser Settings - BotTalk Audio CMS"><figcaption><p>Parser Settings - BotTalk Audio CMS</p></figcaption></figure>

{% hint style="info" %}

1. Activate the BotTalk parser by checking the checkbox: **Enable automatic parsing of articles***.*
2. Enter your website URL.
3. Set the starting date. BotTalk Parser will convert all articles published after this date into speech. The parser will ignore all articles published before this date.
4. **Update speech on article text change***:* a news article gets updated on average 2,8 times. Please check this checkbox if you want to keep your written articles in synch with the audio editions.
   {% endhint %}

{% hint style="danger" %}
Setting the **Include articles published after** field is essential. If set incorrectly, you run into the risk of high TTS costs due to the audification of the whole archive of your website.
{% endhint %}

## Article Publication Date Format

BotTalk needs an **article publication date** to correctly sort, deduplicate, and process content. The parser can usually detect this automatically, but you can also provide it manually when needed.

### How BotTalk determines the publication date

#### 1) Automatic detection (preferred)

BotTalk parser first tries to extract the article’s publication date from **JSON-LD** embedded in the page (typically a `<script type="application/ld+json">…</script>` block).

Common JSON-LD fields that may contain a publication date include:

* `datePublished`
* `dateCreated`
* `dateModified` (used only if publication date is not available, depending on site structure)

If JSON-LD is present and contains a valid date, no additional configuration is required.

#### 2) Manual date (fallback)

If the page **does not contain JSON-LD** (or it’s missing a usable date), you must provide the article publication date **manually via extractor rules**.

Use manual date only when:

* There is **no JSON-LD** on the page, or
* JSON-LD exists but **does not include** a valid `datePublished` (or equivalent), or
* The JSON-LD date is incorrect and must be overridden.

It's recomended to use the [\<time> HTML tag](https://developer.mozilla.org/en-US/docs/Web/HTML/Reference/Elements/time). But other tags are supported as well:

<figure><img src="https://2378192786-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FirnDC5K5ndnjPJHyDBre%2Fuploads%2F10AzsxX9cDhJEo88iAkL%2Fimage.png?alt=media&#x26;token=e0b06a82-eb9a-4117-9b1b-474a3afd6937" alt=""><figcaption></figcaption></figure>

***

## Whitelisting BotTalk Parser in Firewall

Sometimes BotTalk Parser can get blocked by your firewall settings.

You will see the error message in your browser console with `403` HTTP Error code, stating that BotTalk Parser can not access your website.

In order to fix this, please make sure to whitelist ***both*** BotTalk Parser IP and HTTP Headers:

{% hint style="warning" %}
**BotTalk Parser IP:**

`138.199.162.79`
{% endhint %}

{% hint style="warning" %}
**BotTalk Parser HTTP Headers:**

`"User-Agent", "bottalk/extractor.1.1."`
{% endhint %}

## Limiting TTS Costs

Avoiding to run into the risk of high TTS costs you can set a daily character limit. The limit is set by default at 100.000 characters.&#x20;

{% hint style="danger" %}
The calculation of the Daily Character Limit **DOES include article updates**. This means, to make sure that the articles get the newest audio version after you edit them, please include a buffer of 20% into your calculation of Daily Character Limit. Otherwise BotTalk will not update the edited articles, when the Limit quota is hit.
{% endhint %}

You can view the total number of characters used in the "Cost of Audification" statistics.

To specify the limit, click on the checkbox "Daily Character Limit". Afterwards the field "Specify Limit" appears and you can set up your custom limit.&#x20;

<figure><img src="https://2378192786-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FirnDC5K5ndnjPJHyDBre%2Fuploads%2FOtrMRa89JaCBLEwCB2IN%2FBildschirmfoto%202023-03-15%20um%2014.15.18.png?alt=media&#x26;token=c4d1a0c7-5612-4fe0-9672-170f2858010a" alt=""><figcaption><p>Set up custom daily character limit</p></figcaption></figure>
