The Research Database Complex (RDC) at Indiana University
On this page:
- System overview
- System information
- Working with ePHI research data
- System access
- Computing environment
- Transferring your files to the RDC
- Application development
The Indiana University Research Database Complex (
rdc.uits.iu.edu) supports research-related databases and data-intensive applications that require databases. The RDC supports Oracle and MySQL databases, and provides an environment (
rdcweb.uits.iu.edu) for database-driven web applications focusing on research.
The system runs Red Hat Enterprise Linux 5 (RHEL 5). User home directories reside on a network-attached storage (NAS) device. You have a 100 GB (default) disk quota, which is shared with Big Red II, Quarry, and Mason, if you have accounts on those systems.
Note: The RDC is strictly devoted to supporting research; it is not an instructional or classroom environment. If you need to use Oracle or Microsoft SQL Server in an instructional environment, see Database and web server access for instruction.
The RDC offers Oracle 11g Release 2 (version 184.108.40.206) and MySQL Enterprise Server (version 5.5.8 Advanced) database accounts, with a full suite of Oracle components that support:
Advanced Security Option (ASO): Provides data
encryption and strong authentication services to the Oracle
Application Express: Offers development and
deployment of secure applications through a rapid, web-application
development tool for use with Oracle databases
Large objects (LOBs): Lets you store and
manipulate large blocks of unstructured data, such as text, graphic
images, video clips, and sound waveforms, in binary or
Oracle Multimedia (formerly Oracle interMedia):
Provides a platform for a wide range of multimedia-intensive
Oracle Text: Indexes any document or textual
content to add fast, accurate retrieval of information to Internet
content management applications, e-business catalogs, news services,
and job postings; indexes content stored in file systems, databases,
or on the web
Oracle XML Database: Treats XML as a
native datatype in the database
Oracle XDK: Contains the basic building blocks
for reading, manipulating, transforming, and viewing XML documents,
whether on a file system or stored in a database
- Partitioning: Lets you split large tables and indexes into smaller, manageable components, without requiring changes to underlying applications
Oracle Data Mining (ODM): Provides a way to
access information buried in the data by creating models to find
hidden patterns in large, complex collections of data; embeds data
mining within the Oracle database; algorithms operate natively on
relational tables or views, eliminating the need to extract and
transfer data into other tools, applications, or servers
- Online Analytical Processing (OLAP): Offers in-database, advanced multidimensional analytic capabilities
JServer Java Virtual Machine: A Java virtual
machine (VM) that runs within the Oracle database server's address
- Oracle Database Java Packages: Classes for relational database management system (RDBMS) features
Note: The RDC maintenance window is the first Tuesday of each month, 8am-5pm. Notice of any emergency downtime will be posted at IT Notices.
|System configuration||Aggregate information||Per-node information|
|Machine type||Research database system|
|Operating system||Red Hat Enterprise Linux 5 (RHEL 5)|
|Memory model||Distributed and shared|
|CPUs||Intel Xeon E5620 2.40 GHz (HP)
Intel Xeon Quad Core 1.6 GHz (Dell)
|Nodes||3 Hewlett Packard DL 180 G6 Oracle servers
1 Hewlett Packard DL 180 G6 MySQL server
1 Dell 2950 Database Driven Web Services
|RAM||288 GB||72 GB (HP)
8 GB (Dell)
|RPeak||307.2 gigaFLOPS||76.8 gigaFLOPS|
|Database storage and file systems
Note: Before storing data on this system, make sure you understand the information in the Working with ePHI research data section (below).
|Oracle and MySQL storage is provided via RAID 6 volumes
(block-level striping with double distributed parity) on a fiber
channel array, currently 48 TB in size.
Home directories with 100 GB default quotas are provided via NAS over NFS.
Shared scratch space is hosted on the Data Capacitor II (DC2) file
system. The DC2 scratch directory is a temporary workspace. Scratch
space is not allocated, and its total capacity fluctuates based on
project space requirements. The DC2 file system is mounted on IU
research systems as
Note: The Data Capacitor II (DC2) high-speed,
high-capacity, storage facility for very large data sets replaces the
former Data Capacitor file system, which was decommissioned January 7,
2014. The DC2 scratch file system (
|Backup and purge policies||Incremental backups of the RDC Oracle databases occur
at various times between 1am and 6am, Sunday through Friday.
Full backups occur 1am-5am every Saturday. Backups are retained for 30 days.
Backups for MySQL database servers on the RDC are the responsibility of the user.
Working with ePHI research data
The Health Insurance Portability and Accountability Act of 1996 (HIPAA) established rules protecting the privacy and security of personal health data. The HIPAA Security Rule set national standards specifically for the security of protected health information (PHI) that is created, stored, transmitted, or received electronically (i.e., electronic protected health information, or ePHI). To ensure the confidentiality, integrity, and availability of ePHI data, the HIPAA Security Rule requires organizations and individuals to implement a series of administrative, physical, and technical safeguards when working with ePHI data.
Although you can use this system for processing or storing electronic protected health information (ePHI) related to official IU research:
- You and/or the project's principal investigator (PI) are
responsible for ensuring the privacy and security of that data, and
complying with applicable federal and state laws/regulations and
institutional policies. IU's policies regarding HIPAA compliance
require the appropriate Institutional Review Board (IRB) approvals and
a data management plan.
- You and/or the project's PI are responsible for implementing HIPAA-required administrative, physical, and technical safeguards to any person, process, application, or service used to collect, process, manage, analyze, or store ePHI data.
The UITS Advanced Biomedical IT Core provides consulting and online help for Indiana University researchers who need help securely processing, storing, and sharing ePHI research data. If you need help or have questions about managing HIPAA-regulated data at IU, contact the ABITC. For additional details about HIPAA compliance at IU, see HIPAA & ABITC and the Office of Vice President and General Counsel (OVPGC) HIPAA Privacy & Security page.
Important: Although UITS HIPAA-aligned resources are managed using standards surpassing official standards for managing institutional data at IU and are appropriate for storing HIPAA-regulated ePHI research data, they are not recognized by the IU Committee of Data Stewards as appropriate for storing institutional data elements classified as Critical that are not ePHI data. For help determining which institutional data elements classified as Critical are considered ePHI, see Which data elements in the classifications of institutional data are considered protected health information (PHI)?
The IU Committee of Data Stewards and the University Information Policy Office (UIPO) set official classification levels and data management standards for institutional data in accordance with the university's Management of Institutional Data policy. If you have questions about the classifications of institutional data, contact the appropriate Data Steward. To determine the most sensitive classification of institutional data you can store on any given UITS service, see the "Choosing an appropriate storage solution" section of At IU, which dedicated file storage services and IT services with storage components are appropriate for sensitive institutional data, including ePHI research data?
Requesting an RDC account
To request an account on an Indiana University research system, use the Account Management Service (AMS); see At IU, if I already have some computing accounts, how do I get others? Account availability depends on your eligibility. For eligibility information, see the "Research system accounts (all campuses)" section in What computing accounts are available at IU, and for whom?
You will receive confirmation in email when your RDC account is created.
Requesting an RDC database account
Your RDC account confirmation message will direct you to the RDC Database and Web Services Account Application for requesting an RDC database account.
Note: You can request a group RDC database account if your group has an established IU group account username, but you must provide the IU Network ID usernames of everyone who will be permitted to use the group database account. The person requesting the group database account will be considered the owner and responsible party for the account, and is expected to communicate system downtimes and other system information to the group.
You will receive confirmation in email when your database schema, instance, or server has been created. The confirmation notice ("Oracle/MySQL Welcome Letter") will include your database login credentials and information about connecting to your database. UITS recommends saving your "Oracle/MySQL Welcome Letter" so you can refer to it if you forget the database password. Otherwise, you will need to contact the UITS High Performance Systems (HPS) team to have your database password reset.
Connecting to your RDC database
For instructions on connecting to your RDC database, see On the Research Database Complex at IU, how do I access my Oracle or MySQL database?
The Bash shell
By default, new accounts on the RDC are assigned the Bourne-again shell (i.e., the Bash shell). When you log in, the Bash shell reads and executes commands from the following startup files (in this order):/etc/profile ~/.bash_profile ~/.bashrc
When you log out, the Bash shell reads and executes commands from
~ (tilde) represents your
home directory (e.g.,
~/.bash_profile is the
.bash_profile file in your home directory).
In the Bash shell:
- To display the value of an environment variable, on the command
VARNAMEwith the name of an environment variable (e.g.,
- To temporarily change the value of an environment variable and
make programs that use the variable aware of the new value, on the
VARNAMEwith the name of an environment variable (e.g.,
EDITOR) and replace
VALUEwith the desired value (e.g.,
vi). The value will remain changed until you log out from the system or exit the shell.
- To permanently change the value of an environment variable, add
export VARNAME=VALUEline to your
For example, to makeexport EDITOR=vi
viyour default text editor, add this line to your
After you save and exitsource ~/.bash_profile
~/.bash_profile, the new environment variable will take effect the next time you log into the system. To make your change take effect immediately, on the command line, enter:
Changing your shell
In addition to the Bash shell, the RDC supports the TC
tcsh), C (
csh), Korn (
and Bourne (
sh) shells. To change your shell on the RDC,
||Changes your shell only on the node on which you run it, and leaves the other nodes of the cluster unchanged|
||Prompts you with the shells available on the system, and changes your login shell system-wide within 15 minutes|
Transferring your files to the RDC
SCP: This command-line utility is included with
OpenSSH. Basic use is:
scp username@host1:~/file1 username@host2:~/file1_copy
For example, to copy a file from your home directory on your local computer (e.g.,scp ~/foo.txt firstname.lastname@example.org:~/foo.txt
~/foo.txt) to your home directory on the RDC, on the command line, enter (replace
usernamewith your IU Network ID username):
SFTP: Use a command-line or graphical SFTP client
to access, transfer, and manage your RDC files.
For example, to transfer a file using command-line SFTP from your home directory on your local computer (e.g.,$ sftp email@example.com firstname.lastname@example.org's password: Connected to rdc.uits.iu.edu. sftp> put ~/foo.txt Uploading foo.txt to /N/hd02/username/RDC/foo.txt foo.txt 100% 43MB 76.9KB/s 09:39 sftp> exit
~/foo.txt) to your home directory on the RDC, on the command line, enter (replace
usernamewith your IU Network ID username; at the
passwordprompt, enter your Network ID passphrase):
In addition to hosting research databases, the RDC provides an environment for developing database-driven web applications with a research focus. For details, see Web Services on the IU Research Database Complex.
Oracle: For further documentation, see the Oracle Database Documentation Library, 11g Release 2 (11.2), and the following online guides:
- Oracle Database New Features Guide
- Oracle Database Concepts
- Oracle Database Online Documentation Library Master Glossary
- Advanced Application Developer's Guide
- Oracle Database Reference
- SQL Language Reference
- PL/SQL Language Reference
- Application Express (ApEx) Documentation
- ApEx Developer's Guide
- ApEx Application Builder User's Guide
MySQL: For further documentation, see the MySQL Reference Manual.
The RDC is strictly devoted to supporting research. The RDC is not an instructional, classroom environment. If you are not doing research and wish to use a database, such as Oracle or Microsoft SQL Server, see Database and web server access for instruction.
For RDC usage policies, including information about your responsibilities for maintaining database security and acknowledging grant support, see Research Database Complex (RDC) usage policies.
- If you have system-specific questions about Big Red II, Quarry,
Mason, or the Research Database Complex (RDC), email the High
Performance Systems team.
- If you have questions about compilers, programming, scientific and
numerical libraries, or debuggers on a research computing system, email Scientific
Applications and Performance Tuning team.
- If you have questions about statistical and mathematical software
on any of the research computing systems, email the
Research Analytics group.
- If you have questions about scratch or project space on the Data
Capacitor II or Data Capacitor Wide Area Network (DC-WAN) file system,
email the High
Performance File Systems team.
- If you have questions about the Research File System (RFS) or Scholarly Data Archive (SDA), email the Research Storage team.
To ask any other question about Research Technologies systems and services, use the Request help or information form.