Fachlicher Schwerpunkt dieses Freiberuflers

Java software development and architecture: Big data, Hadoop, NoSQL, search engines, data mining, data science, machine learning, Linux

Available from
Oct 15, 2019
Availability
100 %
on customer site
100 %
Areas

D1

Contact Settings

I prefer getting offers for these sites. However, you may also contact me for interesting projects in other places.

Projects

04/2015 - 07/2015

4 months

Rollout Hortonworks BD cluster for data scientists

Roles
Architect, Software-Engineer, DevOps-Engineer
Customer
ING-DiBa AG
Place of work
Frankfurt am Main
Tasks

Testdeployment of a big data infrastructure (Hortonworks, Ambari, Docker, Vagrant, Bash). Final planning and provisioning of the cluster (rights management, security and access, directory services connectivity (AD/LDAP/Kerberos), terminal server, secure integration of software components and repositories. Training and consulting of data scientists for scalability of single/shared memory algorithms towards big data (Spark, Hive, Hadoop) and utilization of continuos integration (Jenkins, Git).

Skills

Continuos integration

Terminal server

Scrum

Virtualization

MapReduce

Big data

Products

Docker

Vagrant

Hortonworks

Ambari

Spark

Git

Hadoop

Hive

LDAP

Kerberos

01/2015 - 03/2015

3 months

PaaS: Customer taylored webcrawler and dataminer

Roles
Developer and architect
Customer
Company in the course of formation
Place of work
Freiburg
Tasks

Development and architecture of a PaaS: Taylored webcrawler and dataminer. Vertical search engine. Efficient crawling and indexing of URLs. Graph-, NLP-, ML- and regex-based datamining. REST based webservices.

Skills

Storm

Titan

ElasticSearch

Ansible

NLP

Mahout

Cassandra

Kafka

Spring

Stanford CoreNLP

Hadoop

Spark

Machine learning

Products

Eclipse

Git

Maven

02/2014 - 12/2014

11 months

Development of a highly scalable semantic middleware.

Roles
Developer
Place of work
Wiesbaden
Tasks

Development and architecture of a big-data platform to link and analyse dynamically unstructured data. Development of the incremental analytics (NLP, hazelcast MR, elasticsearch, Cassandra); Analysis, transformation and persistence of the derived data in a distributed graph-database (Titan, Gremlin, Tinkerpop/Blueprints); Development of high throughput batch-jobs (Hadoop MR, Faunus, TitanHadoop).

Skills

Java

Java EE

Shell-Script

SQL

Gremlin

XML

NoSQL

SOAP

REST

Spring (Batch, Boot, Core)

Lucene

ElasticSearch

Natural Language Processing

SCRUM

Continuos Integration

HDFS

MapReduce

DAO/ORM

Products

Eclipse

Ant

Maven

Subversion

Git

Gitorious

JUnit

EasyMock

ActiveMQ

Hudson

Titan

Faunus

Cassandra

Hadoop

MySQL

Hibernate

OpenNLP

01/2001 - 01/2014

13 years 1 month

Older projects

Roles
Developer, Architect
Place of work
Misc.
Tasks

--------------------------------------------------------------------------------

Zeitraum: 03/13-01/14

Branche: IT-Dienstleistungen

Projekt: Architektur und Entwicklung einer spezialisierten und verteilten Suchmaschinenanwendung für SEO-Dienstleistungen.

[Suchmaschine] Crawling, Parsing, Indexing mit Hadoop/Map-Reduce-Jobs, Storage in HDFS u. HBase (content), Cassandra (links, tld-zonefiles(domains)), Index mit ElasticSearch (diverse doctypes, mappings und indexes) [Frontend] Entwicklung des Frontends mit

GWT [Backend] Dienste als GWT RPC Services und Servlets (REST), Datenbankanbindung der Client/User-Daten, Security, Login, Text- Processing mit CoreNLP und Weka, Mailing [Client-App.] Entwicklung einer Desktop-Applikation mit Java FX, Browsersteuerung via Selenium, Backend-Kommunikation via JSON/servlets. Steuerung mehreren Browser- Instanzen, lokale Datenverwaltung mit SQLite, schneller Crawler für client URLs.

Position: Architektur, Entwicklung und Review.

Knowledge: [Sprachen] Java, XML/XSD, Shell-Script, Python, JavaScript, SQL, FXML,

& Kontext UiBinder XML, CSS, HTML [Technologien] J2EE/JEE, MapReduce, SQL & NoSQL, JavaFX, GWT, JSP, WebServics, DAO/ORM (Hibernate), Machine Learning (Weka),

Lucene, JavaMail, JavaFX, MVC, Data Mining & Klassifizierung (Mahout), Natural Language Processing (CoreNLP) [Produkte/Tools] Tomcat, MySQL, HBase, Cassandra (thrift, CQL3), Hadoop (MR,HDFS,Pig,Hive), Netbeans, Eclipse, Ant, Maven, Subversion, Selenium, Nutch, JUnit, ElasticSearch [überlappende Projekte, s.u.]

--------------------------------------------------------------------------------

Zeitraum: 06/09-06/12

Branche: Forschung

Projekt: Entwicklung eines parallelen, partikelbasierenden Multiphysik-

Simulators für die numerische Charakterisierung und Analyse

schwingender Tropfen.

Entwicklung eines effizienten, partikelbasierenden Multiphysik-Simu-

lators im DFG-Projekts "Electrowetting-Simulation mit Partikelme-

thoden": numerische Simulation von kleinen Tropfen, welche mittels

elektrischer Felder bewegt werden; Schwerpunkte: diskrete Simulation,

physikalische Modellbildung, numerische Charakterisierung, scientific

computing. [Implementierung] modulare Klassenhierarchien für Kräfte,

Integratoren, Paarlistenberechnung und Interaktoren. Adaptivität und

verschiedene solver für die Lösung von linearen Gleichungssystemen.

Optimierter und parallelisierter Code.

Knowledge: [Methoden W] Smoothed Particle Hydrodynamics, Dissipative Particle

& Kontext Dynamics, Smoothed Dissipative Particle Dynamics, Moving Least

Squares, Lineare Gleichungssysteme [Modellbildung] Hydrodynamik,

Elektrostatik, Partielle Differentialgleichungen, Statistik

[Sprachen] Java, C++ [Produkte/Tools] JUnit, Ant, SVN, Netbeans, ProGuard

(Obfuscation), Trac [Plattform] Linux [Methoden] Agile Softwareentwicklung (SCRUM)

Position: Modellierung, Architektur, Entwicklung, Projektleitung --------------------------------------------------------------------------------

Zeitraum: 01/09-12/12

Branche: Forschung

Projekt: Administration Linux-Cluster.

Beschaffung, Aufbau und Administrierung eines Linux HPC-Clusters

(shared und distributed memory, grid).

Knowledge: [Sprachen] Shell-Script, Python [Dienste] Grid (Sun Grid Engine),

& Kontext Apache Httpd, CMS (Zope/Plone, WordPress), FTP, DNS (Bind), LDAP,

Samba, NFS, Wiki (DokuWiki, MoinMoin), Virtualisierung (VirtualBox,

qemu, VMWare) [Merkmale] GPGPU-Nodes, Failover-Fileserving, UPS

Position: Architektur, Beschaffung, Installation, Administrierung

--------------------------------------------------------------------------------

Zeitraum: 03/09-09/09

Branche: Forschung

Projekt: Entwicklung von Modulen für ein partikelbasierendes Simulatoren-

Framework.

Kopplung an Finite-Elemente Löser, effiziente Lösung schwach

besetzter lineare Gleichungssysteme, Partikel-Feld-Kopplung für

Navier-Stokes-Gleichungen und Elektrostatik, Kopplung von

mesoskopischer und makroskopischer Skala.

Knowledge: [Methoden] Smoothed Particle Hydrodynamics, Finite Element Method,

& Kontext Lineare Gleichungssysteme [Modellbildung] Hydrodynamik, Elektro-

statik, Partielle Differentialgleichungen [Sprachen] C++, Java

[Produkte/Tools] JUnit, SVN, Netbeans, Eclipse, Emacs [Plattform] Linux

Position: Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 10/05-06/08

Branche: Konsumgüter/Halbleiter

Projekt: Entwicklung eines proprietärer design flows.

Entwicklung und Erweiterung eines proprietären semicustom

Designflows (Java, Perl, C). Reporting, Versionierung (CVS, SVN) und

Issuetracking (bugzilla); Erarbeitung des Flowschnittstellendesigns,

Multisite Daten-Management und -Releasing, Versionskontrolle der

Flow-Daten, EDA-Tool-Einbindung, hierarchische XML-basierende

Konfiguration, Erstellung von Dokumentation (Manuals, Man-Pages,

Javadoc) und Tests (JUnit, Perl)

Knowledge: [Sprachen] Java, Perl, C, Python, PHP, XML/XSD, SQL [Produkte/Tools]

& Kontext JUnit, Bugzilla, MySQL, Netbeans, Eclipse, Emacs, CVS,

[Plattform] Linux [Technologie] Swing,

Position: Architektur, Entwicklung --------------------------------------------------------------------------------

Zeitraum: 11/07-03/08

Branche: Konsumgüter/Halbleiter

Projekt: Entwicklung eines SystemC Simulators.

Entwicklung eines performanten SystemC-Simulators für den Mikrocon-

troller m8051ew mit vollständiger Abdeckung des Microcodes/Befehls-

satzes.

Knowledge: [Sprachen] C++, Assembler [Technologie] SystemC [Produkte/Tools] Emacs, CVS

& Kontext [Plattform] Linux

Position: Architektur, Entwicklung, Review --------------------------------------------------------------------------------

Zeitraum: 12/06-06/07

Branche: Konsumgüter/Halbleiter

Projekt: Migration/Entwicklung einer Online-Datenbank (PHP -> Java Server Faces, TXT->MySQL/Oracle) für die Projekteverwaltung.

Erweiterung einer webbasierten Designflow-Projekte-Datenbank zur übergreifenden Dokumentation von Teilprojekten, Releases/IPs und automatisierter Validierung (Perl/Python-Scripts).

Knowledge: [Sprachen] Java, Perl, Python, PHP, XML/XSD, SQL [Technologien] SQL, JavaEE/JEE, MVC, JSF, [Plattform] Linux, [Produkte/Tools]

& Kontext Tomcat, Bugzilla, MySQL, Oracle, Eclipse, Emacs, CVS, Position: Redesign, Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 03/05-07/05

Branche: Umwelt/Gesundheitswesen

Projekt: Entwicklung einer Geoinformationssystem-Anwendung.

Entwicklung einer GUI, Implementierung verschiedener Klassen für die

Berechnung und Darstellung von Geo-Daten.

Knowledge: [Sprachen] C# [Produkte/Tools] esri ArcGIS [Plattform] Windows

& Kontext

Position: Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 01/04-09/05

Branche: Forschung

Projekt: Entwicklung von Simulatoren-Komponenten.

Implementierung von Modellen mit der Finite Elemente und Finite

Volumen Methode. Implementierung einer mesoskaligen Partikel-Feld

Kopplung.

Knowledge: [Sprachen] C++ [Methoden] FEM, FVM [Produkte/Tools] Diffpack, OpenFOAM

& Kontext [Plattform] Linux

Position: Architektur, Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 04/03-06/03

Branche: Forschung

Projekt: Entwicklung und Migration einer Anwendung für Simulationsdaten.

Import, Verarbeitung und Export von Geometrie und Meßpunkten von

ANSYS-Simulationsdaten (Pre/Post-Processing).

Knowledge: [Sprachen] C++, Fortran [Produkte/Tools] Ansys [Plattform] Linux

& Kontext

Position: Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 07/02-09/02

Branche: Industrie-Dienstleistungen

Projekt: Komponenten für Prozess/Leitsteuerungssoftware.

Mitarbeit in der Entwicklung von Leit-/Prozeßsteuerungssoftware.

Knowledge: [Sprachen] C, VB, [Produkte/Tools] Visual Studio [Plattform] Windows

& Kontext

Position: IAESTE Praktikant, Entwicklung

--------------------------------------------------------------------------------

Zeitraum: 08/01-10/01

Branche: Medien

Projekt: MS Windows unattended setup / rollout für SB-Terminals.

Windows unattended setup für autonome, gesicherte SB-Systeme/Terminals

für verschiedene Zielapplikationen. Entwicklung von DLLs (MFC) und

Einbindung mittels proprietärer Makros.

Knowledge: [Sprachen] C/C++, Script [Technologien] MFC [Produkte/Tools] Makrorecorder

& Kontext [Plattform] Windows

Position: Entwicklung

Industries

E-Commerce

IT-Services
Research
Industry-Services
Media
Environment/Health
Consumer products/Semi conductors

Skills

Language Skills
Deutsch
first language
English
fluent
French
lower intermediate
Portuguese
basic knowledge
Serbo-Croatian
basic knowledge
Spanish
fluent

Programming Languages
Ansible
Recent project experience
Assembler
C
C#
C++
GNU-Make
Gremlin
Recent project experience
Java
Recent project experience
Pascal
Perl
PHP
PL/SQL
Recent project experience
Shell
Recent project experience
TeX, LaTeX
Recent project experience
yacc/lex

For more details, see below: Products / Standards / Experiences


Operating Systems
Linux
outstanding knowledge, exclusive use of Linux since 2003
Windows

For more details, see below: Products / Standards / Experiences


Products / Standards / Experiences
A Software engineering
Main field of interest
B Research and development
Secondary field of interest
C Computer administration
Profound linux knowledge

Software engineering – Focus on Java, Big Data, Databases, Search


Programming languages:

Java SE/EE, C/C++, Gremlin, Perl, PHP, Python, XML/XSD,

Shell script, C# , Assembler, VB

Operating systems:

Linux (Ubuntu,Debian, Gentoo), Solaris, Windows

Tags/Misc: see below "Comments"

Tools/Products:

Subversion, Git, CVS, Ant/Ivy, Maven, Jenkins, Hudson,

     JUnit, EasyMock, jMock, Gitorious, Ansible

Databases:

     Cassandra, Titan, HBase, MySQL, Titan, Neo4j,

     Tags/Misc:  Graph-Databases, JDBC, NoSQL, SQL

Web C&S              

     Google Web Toolkit, Selenium, HtmlUnit, Java Server Faces, JavaFX

     Tags:Misc: TCP/IP, Internet, ISO/OSI

Search               

     ElasticSearch, Lucene, Nutch

Frameworks           

     Spring (Core, Boot, Batch)

Beef                 

     Hadoop (MapReduce, HDFS, Pig, Hive), Storm, NLP (CoreNLP, OpenNLP),

     Machine Learning (Weka, Mahout), Kafka, OpenMP

     Tags/Misc: Big Data, Stream processing, Message Queuing

Methoden             

     Objektorientierte Programmierung (OOP, OOD), Continuous

     Integration, MVC, UML, DAO/ORM, Design

     Patterns (Entwurfsmuster), agile Softwareentwicklung (SCRUM)

 

 


Computer Administration - Focus on Ubuntu based Linux server


Hardware             

     Cluster-Computing, GPGPU-Computing, Storage, Virtualisation (VirtualBox, VMWare, KVM)
Services

     Apache Httpd, Tomcat, MySQL, Hadoop, HBase, Zookeeper, elasticsearch, Storm,

     Cassandra, CMS (Zope/Plone, WordPress), FTP, DNS(Bind), LDAP,

     Samba, NFS, Wiki (DokuWiki, MoinMoin)
Cloud                

     Amazon EC2, Amazon Mechanical Turk, Google Apps

 

 


R&D – Focus on algorithms and modelling and simulation of fluid problems with particle methods


Analytics and Physics:

     Modelling, multiphysics/multiscale-simulation, fluid dynamics, statistics, electrostatics,

     heat conduction, stress/strain

Algorithms and numerics:

     Particle methods (smoothed particle hydrodynamics, (smoothed) dissipative

     particle dynamics, moving least squares), PDGL, continuum methods

     (finite elements, finite differences, finite volume), spherical harmonic functions,

     efficiently calculating systems of linear equations, simulated annealing

Tools:

     Mathematica, openFOAM, Diffpack, ANSYS, COMSOL Multiphysics

 


Databases
Cassandra
Recent project experience
HBase
Recent project experience
MySQL
Recent project experience
Neo4J
Oracle
PostgreSQL
Titan
Recent project experience

For more details, see below: Products / Standards / Experiences


Comments

* Projects in the field of R&D or engineering (e.g. signal processing, integral transformations, theoretical focus with algorithms, modelling, analysis, statistics, numerics, computational physics) are very welcome if the project has some overlap with my main focus of software development and big data.

 

* I have many years of experience as Linux-User (since 2003 exclusively Linux at home) and administrator (since 2009).

 

* I've specialized knowledge about search engine optimization, ecommerce and the development of corresponding client-tools.

 

* I can provide my list of scientific publications on request.


Training History

1998-2005

     Study of Microsystems Engineering, Faculty of Engineering, University of Freiburg

2000-2001

     Erasmus semester abroad (University of Basque Country, Faculty of Informatics,

     San Sebastian, Spain)

2002     

     Iaeste Internship (IHM Engenharia e Sistemas de Automação Ltda., Belo

     Horizonte, Brasil)

2005

     Diploma: Dipl.-Ing. Mikrosystemtechnik (Final grade: 1,6)

     Diploma thesis: "Coupling of continuum mechanics with dissipative

     particle dynamics"

2009-2012

     PhD-student, Faculty of Engineering, University of Freiburg

     Computational Physics, Development of Simulators for Fluid-Mechanics

×
×