<span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">As we continue to grow, we’re looking for a skilled GPU Systems Engineer (CUDA) to join our dynamic team and contribute to our mission of transforming business processes through technology.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.</span></span><h1 style="margin-top:16px;margin-bottom:11px;"><span style="font-size:15pt;"><span style="font-family:Arial, sans-serif;">GPU Systems Engineer (CUDA)</span></span></h1><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Job Title:</b> GPU Systems Engineer (CUDA)</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Location:</b> 100% Remote (Continental United States)</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Position Type:</b> In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Experience:</b> 6+ years<br><strong>Salary Range : </strong>$100k to $150k per annnum</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Sponsorship:</b> No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Employment Type:</b> Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Engagement:</b> Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>Compensation:</b> Competitive base salary commensurate with experience, plus benefits.</span></span><br><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">Employment Terms & Visa Policy</span></b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.</b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement.</b> The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">We do not engage in C2C, 1099, or third-party arrangements for this role.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.</b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">No new H1B sponsorship is available for this role.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b>However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.</b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.</span></span><br><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">Job Summary</span></b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">We are seeking a GPU Systems Engineer with deep expertise in CUDA programming, GPU architecture, and high-performance computing to design and optimize compute-intensive workloads on modern accelerator hardware. This role focuses on extracting maximum performance from GPU platforms for AI training, inference, scientific computing, and high-throughput data processing workloads. The ideal candidate combines low-level systems mastery with strong software engineering practices, and has a track record of delivering measurable performance improvements on production GPU systems. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">Key Responsibilities</span></b></span></span><ul style="margin-bottom:4px;"><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Design and implement high-performance CUDA kernels for compute-intensive workloads across AI and HPC use cases.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Profile and optimize GPU code using tools such as Nsight Systems, Nsight Compute, and CUDA profilers.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Tune memory access patterns, occupancy, register usage, and shared memory utilization for peak performance.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Develop highly optimized libraries for linear algebra, attention, and other ML primitives.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Optimize multi-GPU and multi-node training using NCCL, RDMA, and high-performance networking.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Implement custom operators and fused kernels in PyTorch, JAX, or Triton.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Collaborate with ML engineers to identify performance bottlenecks in training and inference pipelines.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Develop benchmarks and regression tests to safeguard performance over time.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Evaluate new GPU architectures and feature sets, and advise on adoption strategy.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Contribute to compiler-level optimizations for tensor programs where appropriate, working at the boundary between ML frameworks and underlying accelerator codegen to unlock performance not reachable through framework-level tuning alone.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Optimize memory hierarchy usage across HBM, L2, shared memory, and registers.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Implement mixed-precision and quantized compute paths that maximize accelerator throughput while preserving numerical fidelity within bounds acceptable for the target workloads.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Document performance characteristics, design decisions, and tuning playbooks for internal teams.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Stay current with GPU architecture, CUDA evolution, and emerging accelerator technologies.</span></span></li></ul><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">Required Qualifications</span></b></span></span><ul style="margin-bottom:4px;"><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Six or more years of experience in GPU programming and performance engineering.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Deep expertise in CUDA C/C++ and GPU programming models.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Strong understanding of modern GPU architectures, memory hierarchies, and execution models.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Hands-on experience profiling and optimizing GPU workloads in production.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Familiarity with NCCL, MPI, and high-performance interconnect technologies.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Experience integrating custom kernels into ML frameworks.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Strong C++ skills and familiarity with modern systems programming practices.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Solid grounding in linear algebra and numerical methods.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Strong communication and collaboration skills with research and engineering teams.</span></span></li></ul><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">Preferred Qualifications</span></b></span></span><ul style="margin-bottom:4px;"><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Familiarity with TensorRT, FasterTransformer, or vLLM internals.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Exposure to compiler infrastructure such as LLVM or MLIR.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Open-source contributions to GPU or ML performance libraries.</span></span></li><li style="margin-bottom:4px;margin-left:8px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Experience with large-scale distributed training infrastructure.</span></span></li></ul><h1 style="margin-top:16px;margin-bottom:11px;"><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;"><b><span lang="en-us" style="font-size:13pt;">How to Apply</span></b></span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Would you like to know more about this opportunity?</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">For immediate consideration, please send your resume to jaya@bvteck.com or contact us at (908) 505-3545. Learn more about Bright Vision Technologies at www.bvteck.com.</span></span><br><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. </span></span><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.</span></span><br><span style="font-size:11pt;"><span style="font-family:Arial, sans-serif;">Position offered by “No Fee Agency.”</span></span></h1><p>Equal Employment Opportunity (EEO) Statement</p>
<p>Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.</p>
<p>BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.</p>