Abstract: Customizing Large Language Models (LLMs) for specific tasks demands high-quality, domain-specific datasets. Existing solutions often struggle with extracting meaningful and structured data ...