How to use the GFG Platform
Help navigating and using the platform
Post Date: 2020-09-18
Topic : Graphs for Genealogy
The GFG platform is designed to help genealogists analyze their data. Each user creates a separate database and then uploads their data. This post explains the process.
The recommended workflow is:
- Registration: You must have login privileges to use the platform. When you first click on the Login menu you will see the link to register. You must be approved before proceeding. You can then login and expose other platform resources.
- Create a Database: Neo4j v4 supports multiple databases on the same platform. You create your database by navigating Database Management > Setup new Neoj4 database. Approach this step carefully. Your database name must be unique and your username and password will control your access. Once submitted the database will be created and access restricted to yourself. Your initial right will be to upload data and run queries. If you wish to include other users or extend your roles, please contact the platform administrator. We plan to have these changes under your control at a later time.
- Data Uploading: Files are uploaded by navigating Data Management > Upload data to your own Neo4j database. While there is some flexibility, the following order of uploads is suggested. The sequence is important because each added file not only creates nodes and edges within the dataset but it also creates new edges with nodes create during prior entries.
- GEDCOM: Export a GEDCOM from your genealogy software. GFG will use the unique identifiers in this GEDCOM to identify the nodes created. Re-loading may cause problems if these identifiers are modified in a separate file. Our beta-testing will explore this risk.This file will create the following nodes and edges
Nodes Edges Edge Links Person
Union
Placechild
father
mother
husband
wife
ufather
umother
bp
dp
upUnion→Person
Person→Person
Person→Person
Person↔Person
Person↔Person
Person→Union
Person→Union
Person→Place
Person→Place
Person→Place
- FTDNA Family Finder Matches: This file is downloaded from the FTDNA website. It has a format like {kit number}_Family_Finder_Matches_{date}.csv. Please do not rename this file.
Nodes Edges Edge Links FF_Kit
FF_MatchFF_KitMatch FF_Kit→FF_Match
- FTDNA Chromosome Browser: This file is downloaded from the FTDNA website. It has a format like {kit number}_Chromosome_Browser_Results_{date}.csv. Please do not rename this file.
Nodes Edges Edge Links CB_Match
CB_Segment
CB_RegionCB_KitMatch
match_segment
match_regionFF_Kit→CB_Match
CB_Match→CB_Segment
CB_Match→CB_Region
- Repeat steps 2 and 3 for each kit: Sequentially load your other avalable kits. You must load the Family Finder kit. In some cases the chromosome browser kit is not available. Not loading the latter will not interfer with the analytics.
- Curated User File: Navigate to Reports - GEDCOM-FTDNA Linkage for Curation and download the Excel file. Enter the kit number of any uploaded kit into the FTDNA_Kit_Name column. Entering other known kit numbers will not impact analytics but may help you work with their owners. Next, identify match you recognize that are in the submitted GEDCOM. Enter the record number (RN) from the GEDCOM on the row for these matches. When you have completed this process, save the file and then upload it as you have other files. You have now linked your traditional family tree to the DNA graph. Some properties of nodes are also updated to enhance the schema. These links enables a set of analytics beginning with phasing.
Nodes Edges> Edge Links None Gedcom_DNA
Gedcom_FF_Kit
Kit_CB_MatchPerson→CB_Match
Person→FF_Kit
FF_Kit→CB_Match - Reports:
- Graphs:
- Comparisons: