Job Overview

Location
Remote, Any Country
Job Type
Full Time
Date Posted
13 hours ago

Additional Details

Job ID
2194
Job Views
31
Work Mode *
Remote

Job Description

TruthScan delivers cutting-edge AI detection solutions that provide protection across text, images, voice, and video, safeguarding organizations from advanced AI-generated threats. With tools like the Email Scam Detector, AI Voice Detector, and deepfake identification systems, TruthScan defends against AI-powered phishing, deepfake media scams, and voice cloning fraud. Our solutions ensure enterprise-grade accuracy, helping organizations prevent security breaches, brand damage, and financial loss. We are dedicated to maintaining trust and security in the rapidly evolving AI-driven landscape.


Position Summary

We are seeking a skilled Data Engineer to support the backend operations of our machine learning division. The ideal candidate will manage the flow of complex unstructured data, ensuring that our research and engineering teams have reliable, scalable access to training resources. This position requires a strong grasp of systems engineering, storage optimization, and data logistics.


Key Responsibilities

  • Data Logistics: Manage end-to-end data flow, from ingestion and storage to provisioning for development environments.
  • Storage Management: Maintain and optimize networked storage volumes and cloud repositories to ensure low-latency access for research workflows.
  • Dataset Preparation: Execute precise data partitioning (training/validation/test) and formatting based on technical specifications.
  • Standardization: Enforce strict naming conventions, directory structures, and quality standards across the organization’s data assets.
  • Operational Support: Troubleshoot data accessibility issues and optimize transfer speeds between storage and compute resources.
  • Process Improvement: Identify bottlenecks in the data supply chain and implement solutions to increase throughput and reliability.


Qualifications

  • Experience managing unstructured data at scale (files, objects, or blobs) within cloud or on-premise infrastructure.
  • Technical proficiency with command-line tools, scripting for file management, and version control systems.
  • Understanding of the machine learning development lifecycle and the infrastructure required to support model training.
  • Strong analytical skills with a focus on process efficiency and data integrity.
  • Ability to work autonomously and coordinate effectively with technical stakeholders.


Benefits:

  • A competitive salary and the freedom to work remotely from anywhere
  • The chance to collaborate with a globally distributed team of ambitious, high-performing individuals
  • 10 days of PTO annually—because rest fuels excellence
  • An all-expenses-paid regional offsite each year to connect, align, and celebrate as a team
  • A culture built on ownership, velocity, and personal growth—where your ideas move fast and your impact is real

Location

Similar Jobs

Dice Tech Recruitment Services

Senior Machine Learning

Full Time

Dice Tech Recruitment Services

Senior AI/ML Engineer

Full Time

Dice Tech Recruitment Services

Data Analyst

Full Time

Dice Tech Recruitment Services

Data Specialist,

Full Time