project-lib for R (deprecated)

If you need to interact with your Watson Studio projects and project assets from a notebook, you can use the project-lib library for R. The library is like a programmatical interface to a project.

By using the project-lib library for R, you can access project metadata and assets, including files and connections. The library also contains functions that simplify fetching files associated with the project.

The project-lib library for R is deprecated and has been replaced by the ibm-watson-studio-lib library for R. Although you can still use the project-lib library, you should start using the ibm-watson-studio-lib library in your notebooks. See ibm-watson-studio-lib for R.

Note:


- The project-lib functions do not encode or decode data when saving data to or getting data from a file.
- The project-lib functions can't be used to access connected folder assets (files on a path in Cloud Object Storage).

The project-lib functions

The instantiated project object that is created after you have imported the project-lib library exposes a set of functions that are grouped in the following way:

Fetch project information

You can use the following functions to fetch project-related information programmatically.

  • get_name()

    This function returns the name of the project.

  • get_description()

    This function returns the description of the project.

  • get_metadata()

    This function returns the project metadata.

  • get_files()

    This function returns a list of the files in your project. Each element in the returned list contains the ID and the name of the file. The list of returned files is not sorted by any criterion and can change when you call the function again.

  • get_assets()

    This function returns a list of all project assets. You can pass the optional parameter assetType to the function get_assets which allows you to filter assets by type. The accepted values for this parameter are data_asset, connection and asset. The value asset returns all of the assets in your project. For example, to get only the data assets, use the function get_assets("data_asset").

  • get_connections()

    This function returns a list of the connections you have in your project. Each element in the returned list contains the ID and the name of the connection.

Fetch files

You can use the following function to fetch files stored in your project.

  • get_file(filename) where filename is the name of the file you want to fetch.

    This function fetches a file into the memory of the running kernel. The function returns a byte buffer which can be used to bind into kernel-specific data structures, for example, an R data frame. This method of fetching files is not recommended for very large files.

    The following example shows you how to fetch a file and read the data into an R data frame:

    # Import project lib
    library("projectLib")
    project <- access_project()
    
    # Fetch data
    my.file <- project$get_file("my_file.csv")
    
    # Read the CSV data file into a data frame
    df.data <-  read.csv(text = rawToChar(my.file))
    head(df.data)
    

Save data

You can use the following function to save data as a file to the project. This function does multiple things. Firstly, it puts the data into a file and then it adds this data as a data asset to your project so you can see the data that you saved as a file in the data assets list in your project.

  • save_data(filename, data, setProjectAsset=TRUE, overwrite=FALSE)

    The function takes the following parameters:

    • filename: the name of the created file.

    • data: the data to upload. The accepted types for this parameter is R raw objects or string buffers.

    • setProjectAsset[optional]: adds the file to the project as a data asset after the data was successfully uploaded to the file. It takes a boolean value and the value true is set by default.

    • overwrite[optional]: overwrites the file if the file already exists. By default it is set to false.

      Here is an example, which shows you how you can save data to a file:

      library("projectLib")
      project <- access_project()
      
      # Capture CSV data from console output
      csv_lines <- capture.output(write.csv(df.data, row.names=FALSE), type="output")
      csv_raw <- charToRaw(paste0(csv_lines, collapse='\n'))
      
      project$save_data("file.csv", csv_raw)
      

Read data from a connection

You can use the following function to get the metadata (credentials) of a given connection.

get_connection: the function takes as input the ID of the connection or the name of the connection. You can get these values by using the get_assets() function which returns the id, name and type of all the assets listed in project.

The function get_connection returns the connection credentials which you can use to fetch data from the connection data source.

Here is an example, which shows you how you can fetch the credentials of a connection by using the get_connection function:

# Import project lib
library(projectLib)
project <- access_project()

# Fetch connection
conn.cred <- project$get_connection(name="<ConnectionName>")

If your connection is a connection to dashDB for example, you can fetch your data by running the following code:

library(ibmdbR)

props <- paste("DASHDB;DATABASE=BLUDB;HOSTNAME=", conn.cred$host, ";PORT=50000;PROTOCOL=TCPIP;", sep="")
conn <- idaConnect(props, uid = conn.cred$username, pwd = conn.cred$password, conType = "odbc")
idaInit(conn)

idf.1 <- ida.data.frame('<TableName>')
head(idf.1)

Fetch connected data

You can use the following function to fetch the credentials of connected data. The function returns a dictionary that contains the connection credentials in addition to a datapath attribute that points to specific data in that connection, for example, a table in a dashDB instance or a database in a Cloudant instance.

get_connected_data: this function takes as input the ID of the connected data or the name of the connected data. You can get these values by using the get_assets() function which returns the id, name and type of all the assets listed in project.

Here is an example, which shows you how to fetch the credentials of connected data in a dashDB instance by using the get_connected_data function:

# Import project lib
library(projectLib)
project <- access_project()

# Fetch credentials of the connected data
conn.data <- project$get_connected_data(id="<ConnectedDataId>")

Parent topic: Loading and accessing data in a notebook