Unstructured Data Science Intern
Bharat.Law
Job Description
Company Description Bharat.law is an innovative legal intelligence platform tailored to the complexities of Indian law. Powered by its proprietary NyaI™ legal intelligence technology, Bharat.law enables precision in legal research, case analysis, and collaborative workflows for legal professionals and litigation-heavy organizations. The platform supports verifiable legal research, efficient case management, and structured legal intake.
Bharat.law’s mission is to reduce friction in Indian litigation by providing clarity and precision, ensuring better outcomes for lawyers, clients, and teams. Built in India, the platform is designed to meet the specific needs of Indian law. We are looking for interns who want to work on real data problems at meaningful scale, not toy datasets or isolated experiments.
What you will work on Contribute to the creation of one of India’s largest legal data and intelligence platforms Build NLP-driven data processing pipelines for terabyte-scale unstructured datasets Help design platform approaches for processing and unifying disparate knowledge sources Work on extraction, transformation, structuring, and enrichment of complex legal and regulatory data Support the development of scalable systems for search, retrieval, and intelligence over large document collections Who should apply 3rd year engineering students in Computer Science, Electrical Engineering, or closely related fields Strong programming fundamentals, especially in Python Interest in NLP, machine learning, information retrieval, or large-scale data systems Comfortable working with messy, unstructured, real-world data Curious, self-driven, and excited by hard technical problems with real-world impact Why this role is different Most internships give exposure to narrow tasks. This role offers the chance to work on foundational infrastructure for a population-scale legal intelligence platform. You will get hands-on experience with large-scale unstructured data, applied NLP pipelines, and platform thinking across multiple knowledge sources.
What you will get A chance to work on meaningful, high-impact technical problems Exposure to real-world NLP and large-scale data engineering challenges Access to Claude Code or Codex to vibe code Mentorship in building systems that operate at production scale Flexibility of remote work, with monthly in-person collaboration in Delhi NCR The opportunity to be part of a company shaping the future of legal research and intelligence in India Location requirement Applicants must be based in the Delhi NCR region . To apply Send us your resume along with a short note on why unstructured data, NLP, or large-scale knowledge systems interest you.