In order to provide a user-friendly system with simple operation command to grasp different objects successfully, this paper describes a combined approach of real time remote vision-based teleoperation and autonomy for a human-aided robotic grasping. In the teleoperation process, motion tracking is carried out by Kinect in real time to detect the positions of the human shoulder, elbow and hand joints such that the robot can imitate the human. Hand gestures are recognized and used to activate autonomous grasping, which can save time and generate more natural grasping poses. In our system, the robot fulfills some special tasks such as picking up objects using easy commands with Kinect as object sensor. Experiment results show that it is effective and user-friendly.