{"id":2070,"date":"2023-01-02T17:03:09","date_gmt":"2023-01-02T16:03:09","guid":{"rendered":"https:\/\/kairntech.com\/doc\/?p=2070"},"modified":"2025-07-31T15:04:16","modified_gmt":"2025-07-31T13:04:16","slug":"how-to-generate-train-and-test-metadata","status":"publish","type":"post","link":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/","title":{"rendered":"How to split a dataset (train, test)?"},"content":{"rendered":"\n<p>When your dataset is ready for Model Experiments, it&#8217;s recommended to generate train &amp; test metadata beforehands. <\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Go to the <em>Model Experiments<\/em> view<\/li>\n\n\n\n<li>Click on the split button at the top right<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"381\" src=\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png\" alt=\"\" class=\"wp-image-5344\" srcset=\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png 1024w, https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-300x112.png 300w, https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-768x286.png 768w, https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1536x572.png 1536w, https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png 1920w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Results will be in &#8216;Corpus&#8217; metadata accessible in:\n<ul class=\"wp-block-list\">\n<li>the <em>Documents<\/em> view with the filter panel for Text classification project<\/li>\n\n\n\n<li>the <em>Segments <\/em>view with the filter panel for Entity detection project<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"686\" height=\"878\" src=\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-117.png\" alt=\"\" class=\"wp-image-5345\" style=\"width:479px;height:auto\" srcset=\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-117.png 686w, https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-117-234x300.png 234w\" sizes=\"auto, (max-width: 686px) 100vw, 686px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Note that when you start a new model experiment, the split will be automatically updated if you have added new annotations in the dataset<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-embed is-type-wp-embed is-provider-kairntech-documentation wp-block-embed-kairntech-documentation\"><div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"wp-embedded-content\" data-secret=\"o4JfrJO3bd\"><a href=\"https:\/\/kairntech.com\/doc\/how-to-experiment-entity-detection-with-models-advanced\/\">How to experiment entity detection with models (advanced)?<\/a><\/blockquote><iframe loading=\"lazy\" class=\"wp-embedded-content\" sandbox=\"allow-scripts\" security=\"restricted\" style=\"position: absolute; visibility: hidden;\" title=\"&#8220;How to experiment entity detection with models (advanced)?&#8221; &#8212; Kairntech Documentation\" src=\"https:\/\/kairntech.com\/doc\/how-to-experiment-entity-detection-with-models-advanced\/embed\/#?secret=3E63TEfn4y#?secret=o4JfrJO3bd\" data-secret=\"o4JfrJO3bd\" width=\"600\" height=\"338\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\"><\/iframe>\n<\/div><\/figure>\n\n\n\n<figure class=\"wp-block-embed is-type-wp-embed is-provider-kairntech-documentation wp-block-embed-kairntech-documentation\"><div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"wp-embedded-content\" data-secret=\"QEzIyxcn0o\"><a href=\"https:\/\/kairntech.com\/doc\/how-to-experiment-categorization-with-models-advanced\/\">How to experiment categorization with models (advanced)?<\/a><\/blockquote><iframe loading=\"lazy\" class=\"wp-embedded-content\" sandbox=\"allow-scripts\" security=\"restricted\" style=\"position: absolute; visibility: hidden;\" title=\"&#8220;How to experiment categorization with models (advanced)?&#8221; &#8212; Kairntech Documentation\" src=\"https:\/\/kairntech.com\/doc\/how-to-experiment-categorization-with-models-advanced\/embed\/#?secret=KL64Qn97dk#?secret=QEzIyxcn0o\" data-secret=\"QEzIyxcn0o\" width=\"600\" height=\"338\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\"><\/iframe>\n<\/div><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>When your dataset is ready for Model Experiments, it&#8217;s recommended to generate train &amp; test metadata beforehands.<\/p>\n","protected":false},"author":6,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[14],"tags":[],"class_list":["post-2070","post","type-post","status-publish","format-standard","hentry","category-advanced-topics"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>How to split a dataset (train, test)? - Kairntech Documentation<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to split a dataset (train, test)? - Kairntech Documentation\" \/>\n<meta property=\"og:description\" content=\"When your dataset is ready for Model Experiments, it&#8217;s recommended to generate train &amp; test metadata beforehands.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\" \/>\n<meta property=\"og:site_name\" content=\"Kairntech Documentation\" \/>\n<meta property=\"article:published_time\" content=\"2023-01-02T16:03:09+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-07-31T13:04:16+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1920\" \/>\n\t<meta property=\"og:image:height\" content=\"715\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"vincent.nibart\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"vincent.nibart\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\"},\"author\":{\"name\":\"vincent.nibart\",\"@id\":\"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de\"},\"headline\":\"How to split a dataset (train, test)?\",\"datePublished\":\"2023-01-02T16:03:09+00:00\",\"dateModified\":\"2025-07-31T13:04:16+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\"},\"wordCount\":104,\"image\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png\",\"articleSection\":[\"Advanced Topics\"],\"inLanguage\":\"en-GB\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\",\"url\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\",\"name\":\"How to split a dataset (train, test)? - Kairntech Documentation\",\"isPartOf\":{\"@id\":\"https:\/\/kairntech.com\/doc\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png\",\"datePublished\":\"2023-01-02T16:03:09+00:00\",\"dateModified\":\"2025-07-31T13:04:16+00:00\",\"author\":{\"@id\":\"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de\"},\"breadcrumb\":{\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage\",\"url\":\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png\",\"contentUrl\":\"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png\",\"width\":1920,\"height\":715},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/kairntech.com\/doc\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to split a dataset (train, test)?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/kairntech.com\/doc\/#website\",\"url\":\"https:\/\/kairntech.com\/doc\/\",\"name\":\"Kairntech Documentation\",\"description\":\"All the information you need to use Kairntech Software, methodology,  user and installation guides.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/kairntech.com\/doc\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de\",\"name\":\"vincent.nibart\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/kairntech.com\/doc\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/8c6c4f0e2ce82e7f30989e62388adbfe6071cdc185ead6e4bff5281aa3255ae2?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/8c6c4f0e2ce82e7f30989e62388adbfe6071cdc185ead6e4bff5281aa3255ae2?s=96&d=mm&r=g\",\"caption\":\"vincent.nibart\"},\"url\":\"https:\/\/kairntech.com\/doc\/author\/vincent-nibart\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How to split a dataset (train, test)? - Kairntech Documentation","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/","og_locale":"en_GB","og_type":"article","og_title":"How to split a dataset (train, test)? - Kairntech Documentation","og_description":"When your dataset is ready for Model Experiments, it&#8217;s recommended to generate train &amp; test metadata beforehands.","og_url":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/","og_site_name":"Kairntech Documentation","article_published_time":"2023-01-02T16:03:09+00:00","article_modified_time":"2025-07-31T13:04:16+00:00","og_image":[{"width":1920,"height":715,"url":"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png","type":"image\/png"}],"author":"vincent.nibart","twitter_card":"summary_large_image","twitter_misc":{"Written by":"vincent.nibart","Estimated reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#article","isPartOf":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/"},"author":{"name":"vincent.nibart","@id":"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de"},"headline":"How to split a dataset (train, test)?","datePublished":"2023-01-02T16:03:09+00:00","dateModified":"2025-07-31T13:04:16+00:00","mainEntityOfPage":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/"},"wordCount":104,"image":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage"},"thumbnailUrl":"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png","articleSection":["Advanced Topics"],"inLanguage":"en-GB"},{"@type":"WebPage","@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/","url":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/","name":"How to split a dataset (train, test)? - Kairntech Documentation","isPartOf":{"@id":"https:\/\/kairntech.com\/doc\/#website"},"primaryImageOfPage":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage"},"image":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage"},"thumbnailUrl":"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116-1024x381.png","datePublished":"2023-01-02T16:03:09+00:00","dateModified":"2025-07-31T13:04:16+00:00","author":{"@id":"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de"},"breadcrumb":{"@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#primaryimage","url":"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png","contentUrl":"https:\/\/kairntech.com\/doc\/wp-content\/uploads\/sites\/2\/2023\/01\/image-116.png","width":1920,"height":715},{"@type":"BreadcrumbList","@id":"https:\/\/kairntech.com\/doc\/how-to-generate-train-and-test-metadata\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/kairntech.com\/doc\/"},{"@type":"ListItem","position":2,"name":"How to split a dataset (train, test)?"}]},{"@type":"WebSite","@id":"https:\/\/kairntech.com\/doc\/#website","url":"https:\/\/kairntech.com\/doc\/","name":"Kairntech Documentation","description":"All the information you need to use Kairntech Software, methodology,  user and installation guides.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/kairntech.com\/doc\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/kairntech.com\/doc\/#\/schema\/person\/e2b5ed8a33aa3f4a90dca6f0a0c5f0de","name":"vincent.nibart","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/kairntech.com\/doc\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/8c6c4f0e2ce82e7f30989e62388adbfe6071cdc185ead6e4bff5281aa3255ae2?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/8c6c4f0e2ce82e7f30989e62388adbfe6071cdc185ead6e4bff5281aa3255ae2?s=96&d=mm&r=g","caption":"vincent.nibart"},"url":"https:\/\/kairntech.com\/doc\/author\/vincent-nibart\/"}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/posts\/2070","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/comments?post=2070"}],"version-history":[{"count":9,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/posts\/2070\/revisions"}],"predecessor-version":[{"id":5346,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/posts\/2070\/revisions\/5346"}],"wp:attachment":[{"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/media?parent=2070"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/categories?post=2070"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kairntech.com\/doc\/wp-json\/wp\/v2\/tags?post=2070"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}