Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-GPU #2114

Closed
wants to merge 15 commits into from
Closed

Multi-GPU #2114

wants to merge 15 commits into from

Commits on Jun 30, 2015

  1. Configuration menu
    Copy the full SHA
    a187629 View commit details
    Browse the repository at this point in the history
  2. Thread-local Caffe

    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    346e262 View commit details
    Browse the repository at this point in the history
  3. Changed the way threads are started and stopped

    - Interrupt the thread before waiting on join
    - Provide a method for looping threads to exit on demand
    - CHECK if start and stop succeed instead of returning an error
    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    baca07f View commit details
    Browse the repository at this point in the history
  4. Persistent prefetch thread

    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    f558693 View commit details
    Browse the repository at this point in the history
  5. Added DataReader for parallel training with one DB session

    - Makes sure each solver accesses a different subset of the data
    - Sequential reading of DB for performance
    - Prefetches a configurable amount of data to host memory
    - Distributes data to solvers in round-robin way for determinism
    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    d1cef66 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b114e37 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    8ca8efc View commit details
    Browse the repository at this point in the history
  8. Multi-GPU

    - Split batches between GPUs, and tree-reduce the gradients
    - Detects machine topology (twin-GPU boards, P2P connectivity)
    - Inserts a callbak in the solver for minimal code change
    - Added gpus flags to caffe.cpp, e.g. '-gpus 0,1' or '-gpus all'
    - Deterministic architecture for reproducible runs
    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    45d30f4 View commit details
    Browse the repository at this point in the history
  9. Remove batch size division by number of devices

    Follow expectation of `iter_size` in compounding the effective batch
    size by the number of devices.
    cypof authored and shelhamer committed Jun 30, 2015
    Configuration menu
    Copy the full SHA
    eb2cfcc View commit details
    Browse the repository at this point in the history

Commits on Jul 27, 2015

  1. Added multi-gpu test

    cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    100feb7 View commit details
    Browse the repository at this point in the history
  2. Tweaks to track device in syncedmem

    mhouston authored and cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    12d3c24 View commit details
    Browse the repository at this point in the history
  3. Tweaks to walking the topology for corner cases

    mhouston authored and cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    d127e11 View commit details
    Browse the repository at this point in the history
  4. Improve broadcast order, start with distant nodes

    mhouston authored and cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    de3877b View commit details
    Browse the repository at this point in the history
  5. Workaround for running on default GPU if ID not specified

    mhouston authored and cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    cd75ad1 View commit details
    Browse the repository at this point in the history
  6. Fix outside loop to loop for full tree depth

    mhouston authored and cypof committed Jul 27, 2015
    Configuration menu
    Copy the full SHA
    d3d8f6e View commit details
    Browse the repository at this point in the history