Citi Distributed Data Technologies Architect VP (Tampa FL or Irving TX) in Irving, Texas
Primary Location: United States,Texas,Irving
Other Location: United States,Florida,Tampa
Education: Bachelor's Degree
Job Function: Technology
Shift: Day Job
Employee Status: Regular
Travel Time: Yes, 10 % of the Time
Job ID: 16048300
Distributed Data Processing Platform Architect leads Distributed Data Servers and Parallel and Distributed Computing Projects:
Evaluates, Estimates, and Creates High Performance Distributed Computing and Distributed Data Management Architectures.
Leads Platform Development Projects: Design and Development of Data Sources Abstraction Connectors, Data Extraction and Ingestion Algorithms, Heterogeneous Data Processing and Integration Solutions.
Estimates Project Resources and Evaluates Cost/Time/Scope tradeoffs.
Creates Agile Project Plans
Leads Projects through SDLC
Delivers Projects and provides appropriate support.
Creates and delivers presentations for technical and business audiences.
Required Knowledge and Skills:
7+ years’ experience in the following Areas:
Software Engineering and Programming Data Structures: Arrays, Lists, Queues, Heaps, Stacks, Trees, Graphs.
Algorithms: Sorting, Searching, Hashing, Parsing, Greedy
Algorithms, Graph Algorithms.
10 years of expert-Level Java server-side software design and development.
Solid core Java and multithreading experience.
Strong knowledge of socket programming and proficient in TCP/IP and UDP-based network programming (sockets, datagrams and multicasting) in Java.
Solid understanding of the major design patterns.
Experience working with any Parser Generators like ANTLR
Experience writing any code generators based on your custom grammar.
Big Data experience is a big plus(Hive, Pig, Spark, etc.).
Any experience extending or contributing to open source frameworks is a big plus.
Data Models: Conceptual, Logical, Physical Relational Data Models. Components, Relation Types, Normal Forms and Normalization Procedures. Relational Data Models Design. Data Modeling Patterns: Super type-subtype, Star Schemas, Snowflakes, Dimensional Modeling. Denormalization procedures.
SQL Parsing, Optimization and Execution
Access paths, Filtering and Lookup Algorithms, Join Algorithms,
Data processing performance vehicles: Indexes and Indexing St
Materialized Views, Clusters, Caching, Parallel Processing
Transactions: Definition, Properties, Isolation Levels. Locking Strategies
Enterprise Level Database servers: Teradata, Oracle, DB2, Sybase Security and Data Access Management in Enterprise DB Servers.
Data Processing Architectures, Memory Architectures, Storage Architectures: shared memory, shared storage, shared nothing SMP and MMP
Extensible hashing, B+Trees, R+Trees.
Buffer Pool Management Strategies.
Data recovery strategies. Redo logs and Undo logs.
Parallel processing patterns: Pipelined and Partitoned DW Appliances: Netezza, Vertica, Greenplum, Exadata Column Oriented Databases
Map Reduce Architectures.
Declarative Data Processing layers. Declarative Data Processing Languages and their translation to MapReduce or SQL.
Data Replication Architectures
Data Access Management Architectures
Data Encryption, Decryption, Masking Architectures. Data Quality Concepts Master Data Repositories Metadata Repositories.
Education BS or higher in Computer Science or Engineering or equivalent work experience
This position is local to Irving TX. Relocation is not budgeted for this position. Candidates from outside of the area must be able to relocate themselves. Relocation assistance will not be provided.