Overview: We are looking for an experienced AI Data Analyst with 7+ years of professional experience, including leadership in tech projects. The ideal candidate should have strong expertise in Python, Machine Learning, AI APIs, and Large Language Models (LLMs). Youll work on cutting-edge AI solutions, including vector-based search and data-driven business insights.
Experience: 7+ Years
Job Location: Mahape, Navi Mumbai (Hybrid Work from Office)
Joining: Immediate Joiners Preferred
Experience must include:
- 2+ year of hands-on experience as a Data Analyst.
- 1+ year of practical experience with AI systems (LLMs, AI APIs, or vector-based search).
- 2+ years of experience working with Machine Learning models/solutions.
- 5+ years of hands-on Python programming.
- Exposure to vector databases (e.g., pgvector, ChromaDB) is a plus.
Key Responsibilities:
- Perform data exploration, profiling, and cleaning across large datasets.
- Design, implement, and evaluate machine learning and AI models for business problems.
- Leverage LLM APIs, foundation models, and vector databases to power AI-driven analysis.
- Build end-to-end ML workflows from data preprocessing to deployment.
- Develop visualizations and dashboards for internal reports and presentations.
- Analyze and interpret model outputs, providing actionable insights to stakeholders.
- Collaborate with engineering and product teams to scale AI solutions across business processes.
Required Skills:
Data Analysis:
- 1+ year of hands-on experience working with real-world datasets
- Strong Exploratory Data Analysis (EDA), data wrangling, and visualization using tools like Pandas, Seaborn, or Plotly
Machine Learning & AI:
- 2+ years of experience applying machine learning techniques (classification, regression, clustering, etc.).
- 1+ year of hands-on experience with AI technologies, including Generative AI, LLMs, AI APIs (e.g., OpenAI, Hugging Face), and vector-based search systems.
- Familiarity with model evaluation, hyperparameter tuning, and model selection.
- Exposure to AI-driven analysis, including RAG (Retrieval-Augmented Generation) and other AI solution architectures.
Programming:
- 3+ years of Python programming experience, with proficiency in libraries like scikit-learn, NumPy, Pandas, etc.
- Strong understanding of data structures and algorithms relevant to AI and ML.
Tools & Technologies:
- SQL/PostgreSQL proficiency.
- Experience with vector databases (e.g., pgvector, ChromaDB).
- Exposure to LLMs, foundation models, RAG systems, and embedding techniques.
- Familiarity with AWS, SageMaker, or similar cloud platforms.
- Version control systems knowledge (e.g., Git), REST APIs, and Linux.
Good to Have:
- Experience with Scrapy, SpaCy, or OpenCV
- Knowledge of MLOps, model deployment, and CI/CD pipelines.
- Familiarity with deep learning frameworks like PyTorch or TensorFlow.
Soft Skills:
- Strong problem-solving mindset and analytical thinking.
- Excellent communication skills, capable of presenting technical information clearly to non-technical stakeholders.
- Collaborative, proactive, and self-driven in a fast-paced, dynamic environment.
Share your resume with kajal.uklekar@arrkgroup.com.