Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last great frontier of untapped data. In the process, Terminal will address many industry-wide pain points, including compliance, manual processes, equipment location, phantom costs, and labor inefficiencies. Ultimately, Terminal will become the central nervous system for the yard, seamlessly connecting all data sources to support an extensive range of essential functions.
Our world class vision engineering team has built an engine that can process the movement of trucks and containers in real-time. Itβs now time to unlock the potential of that engine by building SaaS applications that leverage the vision engine to transform the logistics industry. As part of Terminalβs Site Reliability Engineering team you will help build out the network and IoT infrastructure required to deploy and operate our camera technology at scale.
We are seeking an experienced Principal Site Reliability Engineer with a minimum of 12 years of relevant experience to join our team. As a founding member of our Engineering team, you will play a pivotal role in architecting and developing cutting-edge solutions. The ideal candidate possesses expertise in AWS, proficiency in operations, and running software at scale. They will have a deep understanding of event-driven technologies, hands-on experience with modern data stores, and a commitment to implementing observability and a passion for operational excellence. Taking ownership of production quality, reliability and security.Β
Oversee the deployment, management, and maintenance of IoT devices, including camera systems and sensors. Ensure devices are properly integrated, configured, and secured within the network.
Manage firmware updates and patches for IoT devices, ensuring that all devices are up-to-date and secure. Develop and implement strategies for efficient deployment of updates.
Implement mechanisms for collecting and processing data from IoT devices. Ensure data integrity, availability, and confidentiality.
Troubleshoot and resolve connectivity issues related to IoT devices. Manage integration between IoT devices and cloud infrastructure, ensuring seamless data flow and system interoperability.
Design and implement solutions to scale IoT deployments effectively. Monitor device performance and system health to ensure high reliability and availability.
Design, build, and operate infrastructure using Infrastructure as Code (IaC) tools like Terraform and Ansible. Develop and maintain infrastructure automation to ensure scalability and reliability.
Define and implement best practices for continuous deployment of software and services using CI/CD tools such as GitHub Actions. Automate deployment processes to streamline operations.
Lead incident response efforts, including diagnosis, resolution, and post-mortem analysis. Implement robust monitoring and alerting systems to ensure quick detection and resolution of issues.
Ensure that systems adhere to security best practices and regulatory compliance requirements. Implement security measures and conduct regular audits to safeguard production environments.
Minimum of 12 years of experience in Site Reliability Engineering or a related role, with a proven track record of managing complex production environments.
Strong background in operating systems, networking, distributed systems, and database management. Expertise in AWS cloud services and infrastructure management.
Hands-on experience with deploying, managing, and maintaining IoT devices and sensor systems. Knowledge of IoT protocols (e.g., MQTT, CoAP) and device integration practices.
Experience in managing firmware updates and ensuring the security and functionality of IoT devices.
Proficiency in managing and troubleshooting connectivity issues in IoT environments, including wireless and wired communication protocols.
Experience with data collection and processing from IoT devices, including ensuring data quality and managing large volumes of data.
Demonstrated experience in incident response, production monitoring, and capacity planning. Ability to handle high-pressure situations and ensure system reliability.
Joining the Terminal team means being part of a dynamic, innovative environment where your work directly impacts the future of logistics and the global supply chain. You will work closely with a team of experts passionate about operational excellence and technological innovation. We offer competitive salaries, a comprehensive benefits package, and opportunities for professional growth.Β