# Training & Testing Data

Training and testing is an importan part of ML, do to so, we need some data to import, that the module has not seen. That is why we split the dataframe into **training and testing parts**.

If we would train our module on 100% of our dataframe, and than test it on the same - the results will always be perfect.\
But if we hide some data from him and show it later in the testing, we will get a more realistic picture.

## Methods for Splitting Data

There are multiple ways to split our data into training and testing sets:

* Train-Test split
* Cross Validation


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://digitalgarden.batamladen.com/notes/machine-learning/training-and-testing-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
