Data Engineer
What sets INVID apart is our collaborative and flexible work environment. We encourage our team to raise the bar in everything they do while maintaining a healthy work-life balance. With our hybrid work model, team members thrive both in the office and remotely. We foster a culture of mutual respect, autonomy, and accountability, where your voice matters and your growth is supported. From structured career paths and paid professional development to access to industry events, we’re committed to your success.
Join us at INVID, where innovation meets support, and together we deliver excellence.
Job Description
We are hiring a Data Engineer to build the data infrastructure powering our predictive analytics initiative. You will create the pipelines that turn raw vessel tracking data into training datasets for ML models. The core challenge: we have rich behavioral data (vessel positions, AIS gaps, ship-to-ship transfers, spoofing events) but limited labeled outcomes (confirmed violations, detentions, seizures). You will build pipelines that create usable training data through proxy labels, data joins, and outcome correlation.
Responsibilities
• Build labeling pipelines that join behavioral events to outcome data (sanctions designations, flag changes,
detentions)
• Implement proxy labeling strategies that create training signal from observable outcomes
• Build weak supervision infrastructure to combine multiple noisy labeling rules
• Create and maintain ML training datasets at scale
• Build data validation and quality monitoring systems
• Implement versioning for reproducible model training
• Integrate LRIT position data for prediction validation
• Build pipelines that compare predicted locations against actual LRIT reports
• Create feedback loops that improve model accuracy over time
• Scale data infrastructure as models and data sources grow
Required Skill
• 4+ years data engineering experience
• Strong SQL skills, including complex joins across large datasets
• Experience with Spark, Airflow, or equivalent distributed processing frameworks
• Python for data processing and pipeline orchestration
• AWS experience
• Understanding of ML training data requirements
Education/Certifications
• Bachelor's Degree in Computer Science, Engineering, or related field
Desired Skills (Not Required)
• Experience with geospatial data (PostGIS, H3, spatial joins)
• Maritime, defense, or intelligence domain experience
• Experience with data labeling infrastructure or weak supervision
• Familiarity with real-time streaming data systems
Compliance Requirements
This position requires use of or access to information subject to the Export Administration Regulations ("EAR")
or the International Traffic in Arms Regulations ("ITAR"). Accordingly, all applicants must be U.S. persons within
the meaning of these regulations. Under ITAR, a U.S. person is defined as a U.S. Citizen, U.S. Permanent
Resident, or a person who is a protected individual under the Immigration and Naturalization Act (8 U.S.C.
1324b(a)(3))
Important:
Must be a U.S. citizen and a U.S. resident
This job works on a hybrid work modality (San Juan, Puerto Rico)
Must have a valid driver's license
EEO
Recommended Jobs
Communications/PR Specialist (Contractor)
Contexte et environnement TotalEnergies values the partner ship with our external recruitment providers. As a contractor assigned to TotalEnergies you will be eligible for benefits t hr ough you…
Private Banking Manager - San Antonio
Job ID#: 30938 Private Banking Manager – San Antonio, TX Location: Downtown San Antonio Experience Level: 10–12 years Industry: Private Banking / Wealth Management Specialization: …
Copper Sorter - Metal Recycling Yard
The Copper Sorter role at PTP Metal Recyclers is an operations position responsible for the accurate identification, sorting, and processing of recycled scrap Copper. This role requires excellent att…
Registered Nurse III Critical Care
At Houston Methodist, the Registered Nurse (RN) III position a licensed staff nurse, who as an advanced clinician, functions at the Proficient level according to Benner's model of clinical practice. T…
Technical Product Manager - Supply Chain & Integrations
About MealSuite MealSuite builds end-to-end foodservice technology for healthcare and senior living organizations. Our mission is to help care teams deliver better dining experiences with less eff…
Accounting Position
: Job Title: Accounting Position - AP, Job Costing, AR, and Progress Billing Position Overview: Job Title: Accounting Position - AP, Job Costing, AR, and Progress Billing Location: Houst…
Pharmacy Intern Grad
Job Objectives Provides pharmacy consulting services to customers regarding the effective usage of medications, awareness with drug interactions and offering preventive healthcare services such as…
Director of Environmental Safety and Health
About Us At Team Housing Solutions, we’re passionate about delivering exceptional temporary lodging services to organizations sending teams across the United States and around the globe. We build …
Timing Product Engineering Summer Intern
If you are looking for a challenging and exciting career in the world of technology, then look no further. Skyworks is an innovator of high-performance analog semiconductors whose solutions are power…
CDL-A Truck Driver Home Weekly 1500-1600 /wk
10-4 Logistics USA seeks experienced CDL-A over-the-road drivers for long-haul assignments. The role offers competitive mileage pay with a weekly performance bonus, modern automatic tractors, and reli…