Job Description
Note: The job is a remote job and is open to candidates in USA. Mastech Digital is a provider of digital and mainstream technology staff and services for American Corporations. They are currently seeking a Data Scientist-Python Libraries to develop and maintain Python modules for text parsing and implement NLP techniques to process unstructured data. Responsibilities β’ Develop and maintain Python modules for text parsing, cleaning, and extraction. β’ Implement NLP and text analytics techniques to process unstructured data into structured outputs. β’ Integrate external APIs, open-source libraries, and cloud services into data workflows. β’ Write robust code with error handling and exception management for data pipelines. β’ Build utilities for rule-based text extraction, normalization, and transformation. β’ Document workflows, experiments, and code in a structured manner. Skills β’ 2-5 years of experience in Python-based development. β’ Strong knowledge of NLP libraries and text analytics (spaCy, NLTK, regex, transformers). β’ Familiarity with data parsing, unstructured data processing, and extraction frameworks. β’ Experience with external APIs and JSON/structured data handling. β’ Solid understanding of error handling and debugging practices in Python. β’ Strong analytical skills with ability to work on unstructured datasets. β’ Minimum 7+ years of experience. β’ Local Preferred: Yes Education Requirements β’ Bachelor's degree in Computer Science, Data Science, Engineering, or related field. Benefits β’ Medical, Dental (Including Ortho) & Vision Insurance (Option to Enroll) β’ Paid Leaves (Wherever applicable) β’ Life & Disability Coverage (Upon eligibility) β’ 401K Option, Education Assistance Program and more Company Overview β’ Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want. It was founded in undefined, and is headquartered in , with a workforce of 0-1 employees. Its website is Apply tot his job